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SIM2 POLYPEPTIDES AND POLYNUCLEOTIDES AND USES OF EACH IN 
DIAGNOSIS AND TREATMENT OF OVARIAN, BREAST AND LUNG 

CANCERS 

FIELD AND BACKGROUND OF THE INVENTION 

The present invention relates to SIM2 polypeptides and polynucleotides and to 
methods of diagnosing and treating ovarian, breast and lung cancers. 

Lung cancer is the primary cause of cancer death among both men and women 
in the United States. The five-year survival rate among all lung cancer subjects, 
regardless of the stage of disease at diagnosis, is only 13%. This contrasts with a five- 
year survival rate of 46% among cases in which the disease is still localized. 
However, only 16% of lung cancers are diagnosed at a stage prior to spread of the 
disease. 

Early detection is difficult since clinical symptoms are often not observed until 
the disease has reached an advanced stage. Currently, diagnosis is aided by the use of 
chest x-rays, analysis of the type of cells contained in sputum and fiberoptic 
examination of the bronchial passages. Treatment regimens are determined by the 
type and stage of the cancer, and include surgery, radiation therapy and/or 
chemotherapy. In spite of considerable research into therapies for the disease, lung 
cancer remains difficult to treat. 

Breast cancer is the most common form of cancer in women, and can also be 
diagnosed in men. Over 200,000 new breast cancer cases are diagnosed each year in 
the United States. In the U.S. today, there are more than two million breast cancer 
survivors, and every woman is at risk. 

Molecular biomarkers for breast cancer are of several types. Risk biomarkers 
are those associated with increased cancer risk and include mammographic 
abnormalities, proliferative breast disease with or without atypia, family clustering and 
inherited germ-line abnormalities. Surrogate endpoint biomarkers are tissue, cellular or 
molecular alterations that occur between cancer initiation and progression. These 
biomarkers are utilized as endpoints in short-term chemoprevention trials. Prognostic 
biomarkers provide information regarding outcome irrespective of therapy, while 
predictive biomarkers provide information regarding response to therapy. Candidate 
prognostic biomarkers for breast cancer include elevated proliferation indices such as 
Ki-67 and proliferating cell nuclear antigen (PCNA); estrogen receptor (ER) and 
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progesterone receptor (PR) overexpression; markers of oncogene overexpression such 
as c-erbB-2, TGF-a and EGFR; indicators of apoptotic imbalance including 
overexpression of bcl-2 and an increased bax/bcl-2 ratio; markers of disordered cell 
signaling such as p53 nuclear protein accumulation; alteration of differentiation 
5 signals such as overexpression of c-myc and related proteins; loss of differentiation 
markers such as TGF-b II receptor and retinoic acid receptor; and alteration of 
angiogenesis proteins such as VEGF overexpression [Beenken (2002) Minerva Chir. 
57:437-48]. 

Availability of molecular biomarkers for breast cancer will substantially 
10 increase diagnostic capabilities and enable the development of prognostic indices, 

which combine the predictive power of individual molecular biomarkers with specific 

clinical and pathologic factors. 

Ovarian cancer, is a significant health problem for women in the United States 

and world wide. Although advances have been made in detection and therapy of this 
15 cancer, no vaccine or other universally successful method for prevention or treatment 

is currently available. Management of the disease currently relies on a combination of 

early diagnosis and aggressive treatment, which may include one or more of a variety 

of treatments such as surgery, radiotherapy, chemotherapy and hormone therapy The 

course of treatment for a particular cancer is often selected based on a variety of 
20 prognostic parameters, including an analysis of specific tumor markers. However, the 

use of established markers often leads to a result that is difficult to interpret, and high 

mortality continues to be observed in many cancer subjects. 

Thus, there remains a need for a practical method of diagnosing lung cancer, 

breast cancer and ovarian cancer as close to inception as possible. In order for early 
25 detection to be feasible, it is important that specific markers be found and their 

sequences elucidated. 

The Drosophila single minded (sim) gene is the master regulator of fruit fly 

meurogenesis [Thomas (1988); Mambu (1991)]. SIM protein is a transcription factor 

containing a basic helix-loop-helix (bHLH) motif, two PAS (PER/ARNT/SIM) 
30 domains, and an HST (HIF-a/SIM/TRH) domain [Mambu (1991); Isaac and Andrew 

(1996)]. 

Two mouse homologs of the sim gene (i.e., Siml and Sim2) were cloned. 
Siml maps on mouse chromosome 10 and Siml on mouse chromosome 16 in a region 
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of synteny withHC21 [Fan et al. (1996) Mol. Cell. Neurosci. 7:1-16]. Both mouse 
Siml and Sim2 genes are expressed early [Sim2 from embryonic day 8.0 (E8.0) and 
Siml from day 9.0 (E9.0)] in developing forebrain [Ema et al. (1996) Mol. Cell. Biol. 
16:5865-5875; Fan et al. (1996) supra; Moffett et al. (1996) Genomics 35:144-155; 
Yamaki et al. (1996) Genomics 35:136-143] and outside the central nervous system 
(CNS), in somites, mesonephric duct, and foregut (SIM1), in facial and trunk cartilage, 
trunk muscles (Sim2), and in the developing kidney (SIM1 and SIM2) [Dahmane et al. 
(1995) Proc. Natl. Acad. Sci. 92:9191-9195; Ema etal. (1996) supra; Fan et al. (1996) 
supra; Moffett et al. (1996) supra]. 

In adult mouse, both SIM1 and SIM2 are expressed in kidney and skeletal 
muscles, whereas Sim2 is also expressed in the lung (Ema et al. (1996) Biochem. 
Biophys. Res. Commun. 218:588-594; Moffett et al. (1996) supra]. 

Recently, the cloning of the cDNAs for two human homologs (SIM1 and 
SIM2) of the Drosophila sim gene, and mapping of SW1 to chromosome 6ql6.3-q21 
15 have been reported [Chrast (1997) Genome Research 7:615-624 and Chen et al. 
(1995)]. Northern blot analyses indicated the transcription of several mRNA 
transcripts from the SIM2 gene, including those of 2.7, 3, 4.4 and 6kb. The multiple 
mRNAs may be products of alternative splicing, overlapping transcription, or different 
utilization of 5' or 3' untranslated sequences. At least two different forms of the 
20 human SIM2 gene have been characterized. The long form (GenBank Accession No. 
U80456; SEQ ID NO: 7 is 3921 bp and codes for a protein of 667 amino acid with an 
apparent molecular weight of 74 kD. The short-form (GenBank Accession No. 
U80457; SEQ ID NO: 8) is 2859 bp and codes for a protein of 570 amino acid with an 
apparent molecular weight of 64 kD. 
25 Human SIM-1 and S1M-2, in combination with ARNT, attenuate transcription 

from the hypoxia-inducible erythropoietin (EPO) enhancer during hypoxia. SIM 
protein levels decrease with hypoxia treatment, suggesting a negative feedback 
mechanism. Upregulation and activation of HIF-la is concomitant with attenuation of 
SIM activities [Woods SL., Whitelaw ML., J. Biol. Chem 2002; 277:10236-43]. 
30 PCT Application No. WO 02/12565 discloses the use of SJM2 as a marker and 

possible therapeutic target for specific types of cancer. Increased expression of SIM2 
in specific cancers including colon, prostate and pancreas tumors as compared to 
normal tissues was observed. In accordance, a similar pattern of expression was found 
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by DeYoung and co-worlers [Proc. Natl. Acad. Sci. (2003) 100:4760-5 and Anticancer 
Res. (2002) 22(6A):3 149-57]. Interestingly, neither of the above-publications showed 
an elevated expression of SIM2 in ovarian, lung and breast tumors. 

While reducing the present invention to practice the present inventors 
5 uncovered that elevated levels of SIM2 are associated with ovarian, breast and lung 
tumors, thus showing for the first time that SIM2 can also be used as a marker and 
possible therapeutic target for ovarian, breast and lung tumors. 

SUMMARY OF THE INVENTION 

10 According to one aspect of the present invention there is provided a method of 

diagnosing predisposition to, or presence of ovarian cancer, breast cancer and/or lung 
cancer in a subject, the method comprising determining a level of SIM2 in a biological 
sample obtained from the subject, the level being correlatable with predisposition to, 
or presence or absence of the ovarian cancer, breast cancer and/or lung cancer, thereby 

15 diagnosing predisposition to, or presence of ovarian cancer, breast cancer and/or lung 
cancer in the subject. 

According to further features in preferred embodiments of the invention 
described below, the biological sample is a tissue sample and/or a body fluid sample. 

According to still further features in the described preferred embodiments ,the 

20 tissue sample is selected from the group consisting of an ovarian tissue, a lung tissue 
and a breast tissue. 

According to another aspect of the present invention there is provided a method 
of treating ovarian cancer, breast cancer and/or lung cancer in a subject, the method 
comprising downregulating expression or activity of SIM2 in a lung tissue, breast 
25 tissue and/or an ovarian tissue, thereby treating the ovarian cancer, breast cancer 
and/or lung cancer in the subject. 

According to still further features in the described preferred embodiments the 
SIM2 is selected from the group consisting of SEQ ID NOs: 1, 2, 3, 7, 8 and 9. 

According to still further features in the described preferred embodiments 
30 downregulating expression or activity of the SIM2 is effected by administering to the 
subject: 

(a) a molecule which binds SIM2; 

(b) an enzyme which cleaves SIM2; 
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(c) an antisense polynucleotide capable of specifically hybridizing with 
an mRNA transcript encoding SIM2; 

(d) a ribozyme which specifically cleaves S1M2 transcripts; 

(e) a non-functional analogue of at least a catalytic or binding portion of 
S1M2; 

(f) a molecule which prevents SIM2 activation or substrate binding; 

(g) an siRNA molecule capable of inducing degradation of S1M2 
transcripts; 

(h) a DNAzyme which specifically cleaves SIM2 transcripts or DNA; and 

(i) a molecule which promotes a SIM2-specific immunogenic response. 
According to still further features in the described preferred embodiments the 

molecule which binds SIM2 is an antibody or antibody fragment capable of 
specifically binding the SIM2. 

According to yet another aspect of the present invention there is provided use 
of an agent capable of downregulating SIM2 expression or activity for the treatment of 
ovarian, breast and/or lung cancer. 

According to still further features in the described preferred embodiments the 
agent capable of downregulating SIM2 activity is an antibody or antibody fragment. 

According to still further features in the described preferred embodiments the 
agent capable of downregulating SIM2 expression or activity is an oligonucleotide. 

According to still further features in the described preferred embodiments the 
oligonucleotide is a single or double stranded polynucleotide. 

According to still further features in the described preferred embodiments the 
oligonucleotide is at least 17 bases long. 

According to still further features in the described preferred embodiments the 
oligonucleotide is hybridizable in either sense or antisense orientation. 

According to still another aspect of the present invention there is provided use 
of a S3M2 detecting agent for detecting ovarian, breast and/or lung cancer. 

According to still further features in the described preferred embodiments the 
agent for detecting ovarian, breast and/or lung cancer is an oligonucleotide . 

According to still further features in the described preferred embodiments the 
agent for detecting ovarian, breast and/or lung cancer is an antibody or antibody 
fragment. 
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According to still further features in the described preferred embodiments the 
agent for detecting ovarian, breast and/or lung cancer is coupled to a detectable 
moiety selected from the group consisting of a chromogenic moiety, a fluorogenic 
moiety, a radioactive moiety and a light-emitting moiety. 
5 According to an additional aspect of the present invention there is provided an 

article-of-manufacture comprising a packaging material and a composition identified 
for treating ovarian, breast and/or lung cancer being contained within the packaging 
material, the composition including, as an active ingredient, an agent capable of 
downregulating SIM2 expression or activity. 
10 According to still an additional aspect of the present invention there is 

provided an isolated polynucleotide comprising a nucleic acid sequence encoding a 
polypeptide being at least 80 % homologous to SEQ ID NO: 39, 40 or 41 as 
determined using the BestFit software of the Wisconsin sequence analysis package, 
utilizing the Smith and Waterman algorithm, where the gap creation equals 8 and gap 
15 extension penalty equals 2. 

According to still further features in the described preferred embodiments the 
polypeptide is as set forth in SEQ ID NO: 39, 40 or 41. 

According to a further aspect of the present invention there is provided an 
isolated polynucleotide comprising a nucleic acid sequence being 80 % identical to 
20 SEQ ID NO: 39, 40 or 41, as determined using the BestFit software of the Wisconsin 
sequence analysis package, utilizing the Smith and Waterman algorithm, where gap 
weight equals 50, length weight equals 3, average match equals 10 and average 
mismatch equals -9. 

According to still further features in the described preferred embodiments the 
25 nucleic acid sequence is as set forth in SEQ ID NO: 2 or 3. 

According to yet a further aspect of the present invention there is provided an 
isolated polynucleotide as set forth in SEQ ID NO: 2 or 3. 

According to still a fiirther aspect of the present invention there is provided a 
nucleic acid construct comprising the isolated polynucleotide. 
30 According to still a further aspect of the present invention there is provided an 

isolated polypeptide as set forth in SEQ ID NO: 39, 40 or 41. 

According to still a further aspect of the present invention there is provided a 
method of diagnosing predisposition to, or presence of cancer in a subject, the method 



WO 2004/012847 




PCT/IL2003/000636 



7 

comprising determining a level of SEQ ID NO: 2 and/or 3 in a biological sample 
obtained from the subject, wherein the biological sample is suspected of being a 
cancerous tissue or associated with the cancerous tissue and whereas the level being 
correlatable with predisposition to, or presence or absence of the cancer, thereby 
diagnosing predisposition to, or presence of cancer in the subject. 

According to still further features in the described preferred embodiments the 
determining level of the SEQ ID NO: 2 and/or 3 is effected at an mRNA level. 

According to still further features in the described preferred embodiments the 
determining level of the SEQ ID NO: 2 and/or 3 is effected at a protein level. 
5 According to still further features in the described preferred embodiments the 

determining level of the SEQ ID NO: 2 and/or 3 is effected at a gene amplification 
level. 

According to still a further aspect of the present invention there is provided a 
method of treating cancer in a subject, the method comprising downregulating 

10 expression or activity of SEQ ID NO: 2 and/or 3 in a cancerous tissue, thereby treating 
the cancer in the subject. 

The present invention successfully addresses the shortcomings of the presently 
known configurations by providing SIM2 polypeptides and polynucleotides encoding 
same which can be used to diagnose and treat ovarian and lung cancers. 

15 Unless otherwise defined, all technical and scientific terms used herein have 

the same meaning as commonly understood by one of ordinary skill in the art to which 
this invention belongs. Although methods and materials similar or equivalent to those 
described herein can be used in the practice or testing of the present invention, suitable 
methods and materials are described below. In case of conflict, the patent 

20 specification, including definitions, will control. In addition, the materials, methods, 
and examples are illustrative only and not intended to be limiting. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The invention is herein described, by way of example only, with reference to 
25 the accompanying drawings. With specific reference now to the drawings in detail, it 
is stressed that the particulars shown are by way of example and for purposes of 
illustrative discussion of the preferred embodiments of the present invention only, and 
are presented in the cause of providing what is believed to be the most useful and 
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readily understood description of the principles and conceptual aspects of the 
invention. In this regard, no attempt is made to show structural details of the invention 
in more detail than is necessary for a fundamental understanding of the invention, the 
description taken with the drawings making apparent to those skilled in the art how the 
5 several forms of the invention may be embodied in practice. 
In the drawings: 

FIG. 1 is a schematic illustration showing the genomic organization of SIM2 
expression products. Boxes designate exons and lines designate introns. Arrow heads 
designate primers. 

10 FIG. 2 is a histogram showing SIM2 expression in normal and tumor-derived 

lung samples as determined by real time PCR using SEQ ID NO: 20, normalized to the 
housekeeping gene RPS27A (SEQ ID NO: 23). Two independent experiments are 
shown. 

FIG. 3 is a histogram showing SIM2 expression in normal and tumor derived 
15 colon samples as determined by real time PCR using a SIM2 derived fragment (SEQ 
ID NO: 20). 

FIG. 4 is a histogram showing SIM2 long and short transcript expression in 
normal and tumor-derived lung samples as determined by real time PCR using a SIM2 
derived fragment (SEQ ID NO: 9) corresponding to coordinates 232- 404 of SEQ ID 
20 Nos 7 and 8. Expression of SIM-2 derived fragment (SEQ ID NO: 9) was normalized 
to the expression of RPS27A (SEQ ID NO: 23) housekeeping gene. Two independent 
experiments are shown. 

FIG. 5 is a histograms depicting SIM2 expression as in Figure 4 on a 0-200 

scale. 

25 FIG. 6 is a histogram showing SIM2 long and short transcript (SEQ ID NOs: 7 

and 8) expression in normal and tumor derived lung samples as determined by real 
time PCR using a SIM2 derived fragment (SEQ ID NO: 9). Expression of SEQ ID 
NO: 9 was normalized to the expression of RPS27A (SEQ ID NO: 23) housekeeping 
gene. 

30 FIG. 7 is a histogram depicting SIM2 expression as in Figure 6 on a 0-200 

scale. 

FIG. 8 is a histogram showing SIM2 expression in normal and tumor derived 
ovarian samples as determined by real time PCR using a SIM2 derived fragments 
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(SEQ ID NOs: 9 and 20). Expression was normalized to the averaged expression of 
three housekeeping genes PBGD (SEQ ID NO: 32), ATP-6-syn (SEQ ID NO: 26) and 
18s ribosomal RNA (SEQ ID NO: 29). 

FIG. 9 is a histogram showing S1M2 long variant expression in normal and 
5 tumor derived ovary samples as determined by real time PCR using a SIM2 derived 
fragment (SEQ ID NO: 18). Expression was normalized to the averaged expression of 
two housekeeping genes PBGD (SEQ ID NO: 32) and HPRT1 (SEQ ID NO: 35). 

FIG. 10 is a histogram showing SIM2 long variant expression in normal and 
tumor derived lung samples as determined by real time PCR using a SIM2 derived 
10 fragment (SEQ ID NO: 18). Expression was normalized to the averaged expression of 
three housekeeping genes SDHA (SEQ ID NO: 38), RPS27A (SEQ ID NO: 23) and 
PBGD (SEQ ID NO: 32). 

FIG. 1 1 is a histogram showing SIM2 short variant expression in normal and 
tumor derived ovarian samples as determined by real time PCR using a SIM2 derived 
15 fragment (SEQ ID NO: 19). Expression was normalized to the averaged expression of 
two housekeeping genes PBGD (SEQ ID NO: 32) and HPRT1 (SEQ ID NO: 35) . 

FIG. 12 is a photomicrograph showing the expression of SIM2 long variant in 
- normal and tumor derived breast samples as determined by RT-PCR using a SIM2 
derived fragment (SEQ ID NO: 18). 
20 FIG. 13 is a photomicrograph showing the expression of SIM2 short variant in 

normal and tumor derived breast samples as determined by RT-PCR using a SIM2 
derived fragment (SEQ ID NO: 19). 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 
25 The present invention is of SIM2 polypeptides and polynucleotides encoding 

same, which can be used to diagnose and treat ovarian and lung cancers. 

The principles and operation of the present invention may be better understood 
with reference to the drawings and accompanying descriptions. 

Before explaining at least one embodiment of the invention in detail, it is to be 
30 understood that the invention is not limited in its application to the details set forth in 
the following description or exemplified by the Examples. The invention is capable of 
other embodiments or of being practiced or carried out in various ways. Also, it is to 
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be understood that the phraseology and terminology employed herein is for the 
purpose of description and should not be regarded as limiting. 

Lung cancer and ovarian cancer are deadly diseases for which treatment is 
currently limited. It is well established that early detection of primary, metastatic and 
5 recurrent diseases can significantly impact the prognosis of individuals suffering from 
lung cancer and ovarian cancer. 

Current methods for early detection of ovarian cancer involve transabdominal / 
transvaginal ultrsonography and a blood test for the tumor marker CA-125. While 
ultrasound screening is often misleading since most enlarged ovaries result from 
10 benign cysts, the use of the tumor marker CA-125 in diagnosing ovarian cancer is 
limited due to a high false-positive rate. 

Early detection of lung cancer is currently effected using chest X-rays and 
sputum cytology. Since these methods do not have any effect on mortality, detection 
still may be too late to affect the natural cause of the disease. 
15 The Drosophila single minded (SIM) gene is the master regulator of fruit fly 

meurogenesis [Thomas (1988); Mambu (1991)]. SIM protein is a transcription factor 
containing a basic helix-loop-helix (bHLH) motif, two PAS (PER/ARNT/SIM) 
domains, and an HST (HIF-a/SIM/TRH) domain [Mambu (1991); Isaac and Andrew 
(1996)]. The native human SIM2 gene has been cloned and multiple mRNA products 
20 of the gene have been found by Northern blot analyses. 

S1M2 has been previously associated with colon, pancreatic and prostate 
cancers but not with lung and ovarian cancers (PCT Application No. WO02/12565). 

While reducing the present invention to practice the present inventors 
uncovered, for the first time, that elevated levels of SIM2 are present in ovarian, breast 
25 and lung tumors, thus providing evidence that this gene can be utilized as a diagnostic 
marker for ovarian, breast and lung tumors, or can serve as a basis for a therapeutic 
agent for treating such tumors. 

Thus, according to one aspect of the present invention there is provided a 
method of diagnosing predisposition to, or presence of ovarian cancer, breast cancer 
30 and/or lung cancer in a subject. 

As used herein the term "predisposition" refers to a latent susceptibility to 
ovarian, breast and/or lung cancer, which may lead, under certain conditions, to the 
formation of ovarian, breast and/or lung cancer. 
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As used herein the phrase "ovarian cancer" refers to epithelial tumors (i.e., 
carcinomas) and non-epithelial tumors (e.g., stroma cell and germ cell tumors of the 
ovary). 

As used herein the phrase "breast cancer" refers to non-invasive and invasive 
5 tumors of the breast including Lobular Carcinoma In Situ (LCIS), Ductal carcinoma, 
Ductal Carcinoma In Situ (DCIS) and Carcinoma In Situ. 

As used herein the phrase "lung cancer" refers to cancers of the lung including 
small cell lung cancer and non-small cell lung cancer. 

The method, according to this aspect of the present invention is effected by 
10 determining a level of SIM2 in a biological sample which is obtained from the subject 
thereby diagnosing predisposition to, or presence of ovarian cancer, breast cancer 
and/or lung cancer in the subject. 

As used herein "a biological sample" " refers to a sample of tissue (e.g., breast, 
lung, ovary) or fluid isolated from a subject, including but not limited to, for example, 
15 plasma, serum, spinal fluid, lymph fluid, the external sections of the skin, respiratory, 
intestinal, and genitourinary tracts, tears, saliva, milk, blood cells, tumors, organs, and 
also samples of in vivo cell culture constituents. 

As used herein, the term "level" refers to expression of SIM2 RNA and/or 
protein or SIM2 DNA copy number. 
20 As is further described hereinbelow and in the examples section which follows, 

the present inventors have shown that levels of SIM2 beyond those found in normal 
tissue correlate with predisposition to, or presence or absence of the ovarian cancer, 
breast cancer and/or lung cancer providing evidence that this gene can serve as a 
marker for breast cancer, lung cancer and ovarian cancer. 
25 As used herein SIM2 refers to a SIM2 gene as set forth in sequence coordinates 

34649835 - 34700062 of chromosome 21q22.13, expression products of the SIM2 
gene as well as, fragments and variants thereof. 

As used herein the term "variants" refers to splice variants and allelic variants 
ofSIM2. 

30 The phrase "splice variant* ' refers to alternative forms of RNA transcribed from 

a SIM2 gene. Splice variation arises naturally through use of alternative splicing sites 
within a transcribed RNA molecule, or less commonly between separately transcribed 
RNA molecules, and may result in several mRNAs transcribed from the same gene. 
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Splice variants may encode polypeptides having altered amino acid sequence. The 
term splice variant is also used herein to denote a polypeptide encoded by a splice 
variant of an mKNA transcribed from a gene. 

The phrase "allelic variant" refers to two or more alternative forms of a SIM2 
gene occupying the same chromosomal locus. Allelic variation arises naturally 
through mutation, and may result in phenotypic polymorphism within populations. 
Gene mutations can be silent (no change in the encoded polypeptide) or may encode 
polypeptides having altered amino acid sequence. The term allelic variant is also used 
herein to denote a protein encoded by an allelic variant of a gene. 

Examples of S1M2 expression products (i.e., splice variants) are set forth in 
SEQ ID NOs: 7 (GenBank Accession No. U80456) and 8 (GenBank Accession No. 
U80457), which correspond to the long form and short form of SIM2, respectively. 

It will be appreciated that while reducing the present invention to practice 
additional SIM2 splice variant have been discovered by the present inventors. These 
are set forth in SEQ ID NOs: 2 and 3, further description of which is provided 
hereinbelow and in Figure 1 and Table 1, below. 

Numerous well known tissue or fluid collection (e.g., sputum collection) 
methods can be utilized to collect the biological sample from the subject in order to 
determine the level of SIM2 DNA, RNA and/or polypeptide of the subject. 

Tissue biopsy can be utilized to collect a tissue sample from lung tissue, breast 
tissue and/or ovarian tissue. Methods of performing lung, breast and ovarian biopsies 

are well known in the art. 

Typically, an ovarian biopsy is effected using fine-needle aspiration (FNA), 
which is an important diagnostic tool in gynecology. Its main role is in diagnosis of 
advanced and recurrent gynecologic malignancies. The technique uses a small-gauge 
needle to aspirate a lesion for cytologic analysis, sometimes with the aid of 

radiographic imaging. 

A number of approaches for performing breast biopsies are known in the art. 

A Stereotactic Needle Biopsy is a relatively novel approach, which is 
employed when the physician cannot feel the lump that was found on a mammogram. 
A Stereotactic Biopsy uses mammographic images and computer technology and 
combines them to determine the exact location of an abnormality to obtain a sample of 
breast tissue. In this way, non-surgical techniques are used to determine whether an 
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abnormality is cancerous. This procedure is equally accurate as surgical biopsy 
without the scar and anesthetic risks of surgery. The patient lies facedown on a special 
table with an opening for the breast. Using special equipment, the breast is compressed 
similar to a mammogram but with less compression, and then x-rayed from several 
5 angles. The data is entered into a computer, and then a radiologist inserts a biopsy 

needle into the lump. 

fine Needle Aspiration involves inserting a very fine, hollow needle into a cyst 

to remove some fluid or tissue. 

Core Needle Biopsy utilizes a larger needle inserted into the breast mass to 
10 remove a tissue for examination. 

Ductal Lavage is a relatively new, minimally invasive FDA approved 
procedure being used for women who are considered at high risk of developing breast 
cancer. The doctor inserts a catheter into a milk duct and withdraws a sampling of 
cells. 

15 Lung biopsies can be performed using a variety of techniques. A bronchoscopy 

is preferably effected to retrieve lung tissues which are located deep in the chest. If 
the area lies close to the chest wall, a needle biopsy is often done. If both these 
methods fail, an open surgical biopsy may be carried out. If there are indications that 
the lung cancer has spread to the lymph nodes in the mediastinum, a mediastinoscopy 

20 is performed. 

When a needle biopsy is to be done, the subject will be given a sedative about 
an hour before the procedure, to help relaxation. The subject sits in a chair with arms 
folded on a table in front. X rays are then taken to identify the location of the 
suspicious areas. Small metal markers are placed on the overlying skin to mark the 
25 biopsy site. The skin is thoroughly cleansed with an antiseptic solution, and a local 
anesthetic is injected to numb the area. 

A small cut (incision) about half an inch in length is then being made. The 
subject is asked to take a deep breath and hold it while a special biopsy needle is 
inserted through the incision into the lung. When enough tissue has been obtained, the 
30 needle is withdrawn. Pressure is applied at the biopsy site and a sterile bandage is 
placed over the cut. The entire procedure takes between 30 and 45 minutes. 

The subject may feel a brief sharp pain or some pressure as the biopsy needle is 
inserted. Most subjects, however, do not experience severe pain. 
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Open biopsies are performed in a hospital under general anesthesia As with 
needle biopsies, subjects are given sedatives before the procedure. An intravenous 
line is placed in the arm to give medications or fluids as necessary. A hollow 
endotracheal tube, is passed through the throat, into the airway leading to the lungs. It 
5 is used to convey the general anesthetic. 

Once the subject is under anesthesia, an incision over the lung area is made. 
Some lung tissue is removed and the cut closed with stitches. The entire procedure 
takes about an hour. A chest tube is sometimes placed with one end inside the lung and 
the other end protruding through the closed incision. Chest tube placement is done to 
10 prevent the lungs from collapsing by removing the air from the lungs. The tube is 
removed a few days following the biopsy. A chest X ray is done following an open 
biopsy, to check for lung collapse. 

The preparation for a mediastinoscopy is similar to that for an open biopsy. 
The subject is sedated and prepared for general anesthesia. The neck and the chest 
15 will be cleansed with an antiseptic solution. Once the subject is under anesthesia, an 
incision of about two or three inches long is made at the base of the neck. A thin, 
hollow, lighted mediastinoscope is inserted through the cut into the space between the 
right and the left lungs. The space is examined thoroughly and any lymph nodes or 
tissues that look abnormal are removed. The mediastinoscope is then removed, and the 
20 incision stitched up and bandaged. 

Regardless of the procedure employed, once a biopsy is obtained the level of 
SIM2 can be determined and a diagnosis can thus be made. 

Determining a level of SIM2 can be effected using various biochemical and 
molecular approaches used in the art for determining gene amplification, and/or level 
25 of gene expression. 

It will be appreciated that since SIM2 is expressed in normal lung, breast and 
ovarian tissues at low levels, detection of SIM2 in a normal tissue is preferably 
effected along side to detect an elevated expression and/or amplification. Samples 
used to determine the normal range of SIM2 can be normal samples from individuals 
30 not suffering from the disease condition. 

Typically, detection of a nucleic acid of interest in a biological sample is 
effected by hybridization-based assays using an oligonucleotide probe. 
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The term "oligonucleotide" refers to a single stranded or double stranded 
oligomer or polymer of ribonucleic acid (RNA) or deoxyribonucleic acid (DNA) or 
mimetics thereof. This term includes oligonucleotides composed of naturally- 
occurring bases, sugars and covalent internucleoside linkages (e.g., backbone) as well 
5 as oligonucleotides having non-naturally-occurring portions which function similarly 
to respective naturaUy-occurring portions. An example of an oligonucleotide probe 
which can be utilized by the present invention is a single stranded polynucleotide 
which includes a sequence complementary to the sequence region encompassed by 
SEQIDNO:9or 1. 

10 Oligonucleotides designed according to the teachings of the present invention 

can be generated according to any oligonucleotide synthesis method known in the art 
such as enzymatic synthesis or solid phase synthesis. Equipment and reagents for 
executing solid-phase synthesis are commercially available from, for example, 
Applied Biosystems. Any other means for such synthesis may also be employed; the 

15 actual synthesis of the oligonucleotides is well within the capabilities of one skilled in 
the art and can be accomplished via established methodologies as detailed in, for 
example, 'Molecular Cloning: A laboratory Manual" Sambrook et al., (1989); 
"Current Protocols in Molecular Biology" Volumes I-m Ausubel, TL M., ed. (1994); 
Ausubel et al., "Current Protocols in Molecular Biology", John Wiley and Sons, 

20 Baltimore, Maryland (1989); Perbal, "A Practical Guide to Molecular Cloning", John 
Wiley & Sons, New York (1988) and "Oligonucleotide Synthesis" Gait, M. J., ed. 
(1984) utilizing solid phase chemistry, e.g. cyanoethyl phosphoramidite followed by 
deprotection, desalting and purification by for example, an automated trityl-on method 
orHPLC. 

25 The oligonucleotide of the present invention is of at least 17, at least 18, at 

least 19, at least 20, at least 22, at least 25, at least 30 or at least 40, bases specifically 
hybridizable with SIM2 derived sequence (e.g., SEQ ID NO: 1 which are hybridizable 
with the primers set forth in SEQ ID NOs. 4 and 5). 

The oligonucleotides of the present invention may comprise heterocylic 
30 nucleosides consisting of purines and the pyrimidines bases, bonded in a 3 ! to 5 ! 
phosphodiester linkage. 

Preferably used oligonucleotides are those modified in either backbone, 
internucleoside linkages or bases, as is broadly described hereinunder. 



WO 2004/012847 




PCT/EL2003/000636 



16 

Specific examples of preferred oligonucleotides useful according to this aspect 
of the present invention include oligonucleotides containing modified backbones or 
non-natural internucleoside linkages. Oligonucleotides having modified backbones 
include those that retain a phosphorus atom in the backbone, as disclosed in U.S. Pat. 

5 NOs: 4,469,863; 4,476,301; 5,023,243; 5,177,196; 5,188,897; 5,264,423; 5,276,019; 
5,278,302; 5,286,717; 5,321,131; 5,399,676; 5,405,939; 5,453,496; 5,455,233; 5,466, 
677; 5,476,925; 5,519,126; 5,536,821; 5,541,306; 5,550,111; 5,563,253; 5,571,799; 
5,587,361; and 5,625,050. 

Preferred modified oligonucleotide backbones include, for example, 

10 phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, 
aminoalkyl phosphotriesters, methyl and other alkyl phosphonates including 3- 
alkylene phosphonates and chiral phosphonates, phosphinates, phosphoramidates 
including 3 ! -amino phosphoramidate and aminoalkylphosphoramidates, 
thionophosphoramidates, thionoalkylphosphonates, thionoalkylphosphotriesters, and 

15 boranophosphates having normal 3-5' linkages, 2 ! -5' linked analogs of these, and those 
having inverted polarity wherein the adjacent pairs of nucleoside units are linked 3-5' 
to 5 f -3 f or 2 f -5 ! to 5'-2\ Various salts, mixed salts and free acid forms can also be used. 

Alternatively, modified oligonucleotide backbones that do not include a 
phosphorus atom therein have backbones that are formed by short chain alkyl or 

20 cycloalkyl internucleoside linkages, mixed heteroatom and alkyl or cycloalkyl 
internucleoside linkages, or one or more short chain heteroatomic or heterocyclic 
internucleoside linkages. These include those having moipholino linkages (formed in 
part from the sugar portion of a nucleoside); siloxane backbones; sulfide, sulfoxide 
and sulfone backbones; formacetyl and thioformacetyl backbones; methylene 

25 formacetyl and thioformacetyl backbones; alkene containing backbones; sulfamate 
backbones; methyleneimino and methylenehydrazino backbones; sulfonate and 
sulfonamide backbones; amide backbones; and others having mixed N, O, S and CH 2 
component parts, as disclosed in U.S. Pat. Nos. 5,034,506; 5,166,315; 5,185,444; 
5,214,134; 5,216,141; 5,235,033; 5,264,562; 5,264,564; 5,405,938; 5,434,257; 

30 5,466,677; 5,470,967; 5,489,677; 5,541,307; 5,561,225; 5,596,086; 5,602,240; 
5,610,289; 5,602,240; 5,608,046; 5,610,289; 5,618,704; 5,623, 070; 5,663,312; 
5,633,360; 5,677,437; and 5,677,439. 
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Other oligonucleotides which can he used according to the present invention, 
are those modified in both sugar and the internucleoside linkage, i.e., the backbone, of 
the nucleotide units are replaced with novel groups. The base units are maintained for 
complementation with the appropriate polynucleotide target An example for such an 
5 oligonucleotide mimetic, includes peptide nucleic acid (PNA). A PNA 
oligonucleotide refers to an oligonucleotide where the sugar-backbone is replaced with 
an amide containing backbone, in particular an aminoethylglycine backbone. The 
bases are retained and are bound directly or indirectly to aza nitrogen atoms of the 
amide portion of the backbone. United States patents that teach the preparation of 
10 PNA compounds include, but are not limited to, U.S. Pat. Nos. 5,539,082; 5,714,331; 
and 5,719,262, each of which is herein incorporated by reference. Other backbone 
modifications, which can be used in the present invention are disclosed in U.S. Pat. 
No: 6,303,374. 

Oligonucleotides of the present invention may also include base modifications 

15 or substitutions. As used herein, "unmodified" or "natural" bases include the purine 
bases adenine (A) and guanine (G), and the pyrimidine bases thymine (T), cytosine (C) 
and uracil (U). Modified bases include but are not limited to other synthetic and 
natural bases such as 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, xanthine, 
hypoxanthine, 2-aminoadenine, 6-methyl and other alkyl derivatives of adenine and 

20 guanine, 2-propyl and other alkyl derivatives of adenine and guanine, 2-thiouracil, 2- 
thiolhymine and 2-thiocytosine, 5-halouracil and cytosine, 5-propynyl uracil and 
cytosine, 6-azo uracil, cytosine and thymine, 5-uracil (pseudouracil), 4-thiouracil, 8- 
halo, 8-amino, 8-thiol, 8-thioalkyl, 8-hydroxyl and other 8-substituted adenines and 
guanines, 5-halo particularly 5-bromo, 5-trifluoromethyl and other 5-substituted 

25 uracils and cytosines, 7-methylguanine and 7-methyladenine, 8-azaguanine and 8- 
azaadenine, 7-deazaguanine and 7-deazaadenine and 3-deazaguanine and 3- 
deazaadenine. Further bases include those disclosed in U.S. Pat. No: 3,687,808, those 
disclosed in The Concise Encyclopedia Of Polymer Science And Engineering, pages 
858-859, Kroschwitz, J. L, ed. John Wiley & Sons, 1990, those disclosed by Englisch 

30 et al., Angewandte Chemie, International Edition, 1991, 30, 613, and those disclosed 
by Sanghvi, Y. S., Chapter 15, Antisense Research and Applications, pages 289-302, 
Crooke, S. T. and Lebleu, B. , ed., CRC Press, 1993. Such bases are particularly 
useful for increasing the binding affinity of the oligomeric compounds of the 



WO 2004/012847 




PCT/IL2003/000636 



18 

invention. These include 5 -substituted pyrimidines, 6-azapyrimidines and N-2, N-6 
and 0-6 substituted purines, including 2-aminopropyladenine, 5-propynyluracil and 5- 
propynylcytosine. 5-methylcytosine substitutions have been shown to increase nucleic 
acid duplex stability by 0.6-1.2°C. [Sanghvi YS et al. (1993) Antisense Research and 
5 Applications, CRC Press, Boca Raton 276-278] and are presently preferred base 
substitutions, even more particularly when combined with 2-O-methoxyethyl sugar 
modifications. 

It will be appreciated that olignoculeotides of the present invention may 
include further modifications which increase bioavailability, therapeutic efficacy and 

10 reduce cytotoxicity. Such modifications are described in Younes (2002) Current 
Pharmaceutical Design 8:1451-1466. 

Hybridization based assays which allow the detection of SIM2 (i.e., DNA or 
RNA) in a biological sample rely on the use of oligonucleotide which can be 10, 15, 
20, or 30 to 100 nucleotides long preferably from 10 to 50, more preferably from 40 to 

15 50 nucleotides. 

Hybridization of short nucleic acids (below 200 bp in length, e.g. 17-40 bp in 
length) can be effected using the following examplery hybridization protocols which 
can be modified according to the desired stringency; (i) hybridization solution of 6 x 
SSC and 1 % SDS or 3 M TMACI, 0.01 M sodium phosphate (pH 6.8), 1 mM EDTA 

20 (pH 7.6), 0.5 % SDS, 100 jig/ml denatured salmon sperm DNA and 0. 1 % nonfat dried 
milk, hybridization temperature of 1 - 1 .5 °C below the Tm, final wash solution of 3 M 
TMACI, 0.01 M sodium phosphate (pH 6.8), 1 mM EDTA (pH 7.6), 0.5 % SDS at 1 - 
1.5 °C below the T m ; (ii) hybridization solution of 6 x SSC and 0.1 % SDS or 3 M 
TMACI, 0.01 M sodium phosphate (pH 6.8), 1 mM EDTA (pH 7.6), 0.5 % SDS, 100 

25 jag/ml denatured salmon sperm DNA and 0.1 % nonfat dried milk, hybridization 
temperature of 2 - 2.5 °C below the T m > final wash solution of 3 M TMACI, 0.01 M 
sodium phosphate (pH 6.8), 1 mM EDTA (pH 7.6), 0.5 % SDS at 1 - 1.5 °C below the 
Tm, final wash solution of 6 x SSC, and final wash at 22 °C; (Hi) hybridization 
solution of 6 x SSC and 1 % SDS or 3 M TMACI, 0.01 M sodium phosphate (pH 6.8), 

30 1 mM EDTA (pH 7.6), 0.5 % SDS, 100 ng/ml denatured salmon sperm DNA and 0.1 
% nonfat dried milk, hybridization temperature. 
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The detection of hybrid duplexes can be carried out by a number of methods. 
Typically, hybridization duplexes are separated from unhybridized nucleic acids and 
the labels bound to the duplexes are then detected. Such labels refer to radioactive, 
fluorescent, biological or enzymatic tags or labels of standard use in the art. A label 

5 can be conjugated to either the oligonucleotide probes or the nucleic acids derived 
from the biological sample (target). 

For example, oligonucleotides of the present invention can be labeled 
subsequent to synthesis, by incorporating biotinylated dNTPs or rNTP, or some similar 
means (e.g., photo-cross-hnking a psoralen derivative of biotin to RNAs), followed by 

10 addition of labeled streptavidin (e.g., phycoerythrin-conjugated streptavidin) or the 
equivalent. Alternatively, when fluorescently-labeled oligonucleotide probes are 
used, fluorescein, lissamine, phycoerythrin, rhodamine (Perkin Elmer Cetus), Cy2, 
Cy3, Cy3.5, Cy5, Cy5.5, Cy7, FluorX (Amersham) and others [e.g., Kricka et al. 
(1992), Academic Press San Diego, Calif] can be attached to the oligonucleotides. 

15 Traditional hybridization assays include PCR, RT-PCR, RNase protection, in- 

situ hybridization, primer extension, Southern blot, Northern Blot and dot blot 
analysis. 

Those skilled in the art will appreciate that wash steps may be employed to 
wash away excess target DNA or probe as well as unbound conjugate. Further, 
20 standard heterogeneous assay formats are suitable for detecting the hybrids using the 
labels present on the oligonucleotide primers and probes. 

It will be appreciated that a variety of controls may be usefully employed to 
improve accuracy of hybridization assays. For instance, samples may be hybridized to 
an irrelevant probe and treated with RNAse A prior to hybridization, to assess false 
25 hybridization. 

Specifically, gene amplification may be measured directly by DNA analysis 
such as Southern blot or dot blot techniques. For Southern blotting, DNA is extracted 
using methods which are well known in the art, involving tissue mincing, cell lysis, 
protein extraction and DNA precipitation using 2 to 3 volumes of 100% ethanol, 
30 rinsing in 70% ethanol, pelleting, drying and resuspension in water or any other 
suitable buffer (e.g., Tris-EDTA). Preferably, following such procedure, DNA 
concentration is determined such as by measuring the optical density (OD) of the 
sample at 260 nm (wherein 1 unit OD=50 jig/ml DNA). 
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To determine the presence of proteins in the DNA solution, the OD 260/OD 
280 ratio is determined. Preferably, only DNA preparations having an OD 260/OD 
280 ratio between 1.8 and 2 are used in the following procedures described 
hereinbelow. 

5 The purified DNA is then digested with one or more restriction enzymes, and 

the resulting fragments separated on an agarose gel by electrophoresis. The DNA 
fragments are then transferred to a nylon or cellulose nitrate filter by blotting, and the 
DNA fixed by baking. The filter is then exposed to a labeled complementary probe 
and the regions of hybridization detected, usually by autoradiography. Dot blotting is 

10 similar, except that the DNA fragments are not separated on the gel. The degree of 
gene amplification is then determined by dilutional analysis or densitometry scanning. 

Polymerase chain reaction (PCR)-based methods may be used to identify the 
presence of SIM2 mRNA. For PCR-based methods a pair of oligonucleotides is used, 
which is specifically hybridizable with the SIM2 polynucleotide sequences described 

15 hereinabove in an opposite orientation so as to direct exponential amplification of a 
portion thereof (including the hereinabove described sequence alteration) in a nucleic 
acid amplification reaction. For example, an oligonucleotide pair of primers 
specifically hybridizable with SIM2 are set forth in SEQ ID NOs: 10 and 11 which 
provide an amplification product which corresponds to SEQ ID NO: 9. 

20 The polymerase chain reaction and other nucleic acid amplification reactions 

are well known in the art and require no further description herein. The pair of 
oligonucleotides according to this aspect of the present invention are preferably 
selected to have compatible melting temperatures (Tm), e.g., melting temperatures 
which differ by less than that 7 °C, preferably less than 5 °C, more preferably less than 

25 4 °C, most preferably less than 3 °C, ideally between 3 °C and 0 °C. 

Hybridization to oligonucleotide arrays may be also used to determine SIM2 
expression. Such screening has been undertaken in the BRCA1 gene and in the 
protease gene of HIV-1 virus [see Hacia et aL f (1996) Nat Genet 1996; 1 4(4) :44 1-447; 
Shoemaker et al., (1996) Nat Genet 1996;14(4):450-456; Kozal et aL, (1996) Nat Med 

30 1996;2(7):753-759], 

The nucleic acid sample which includes the candidate region to be analyzed is 
isolated, amplified and labeled with a reporter group. This reporter group can be a 
fluorescent group such as phycoerythrin. The labeled nucleic acid is then incubated 
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with the probes immobilized on the chip using a fluidics station. For example, Manz 
et al. (1993) Adv in Chromatogr 1993; 33:1-66 describe the fabrication of fluidics 
devices and particularly microcapillary devices, in silicon and glass substrates. 

Once the reaction is completed, the chip is inserted into a scanner and patterns 
5 of hybridization are detected. The hybridization data is collected, as a signal emitted 
from the reporter groups already incorporated into the nucleic acid, which is now 
bound to the probes attached to the chip. Since the sequence and position of each 
probe immobilized on the chip is known, the identity of the nucleic acid hybridized to 
a given probe can be determined. 

10 It will be appreciated that when utilized along with automated equipment, the 

above described detection methods can be used to screen multiple samples for ovarian, 
breast or lung cancers both rapidly and easily. 

The presence of SIM2 in an ovarian, breast and/or lung tissue can also be 
determined at the protein level. Numerous protein detection assays are known in the 

15 art, examples include but are not limited to chromatography, electrophoresis, 
immunodetection assays such as ELISA and western blot analysis, 
immunohistochemistry and the like, which may be effected using antibodies specific 
to SIM2. Thus, the present invention envisages the use of serum immunoglobulins, 
polyclonal antibodies or fragments thereof, (i.e., immunoreactive derivatives thereof), 

20 or monoclonal antibodies or fragments thereof for the detection of ovarian, breast 
and/or lung cancer. Monoclonal antibodies or purified fragments of the monoclonal 
antibodies having at least a portion of an antigen-binding region, including the 
fragments described hereinbelow, chimeric or humanized antibodies and 
complementarily determining regions (CDR). 

25 The term "antibody 11 refers to whole antibody molecules as well as functional 

fragments thereof, such as Fab, F(ab , ) 2 , and Fv that are capable of binding with 
antigenic portions of the target polypeptide. These functional antibody fragments 
constitute preferred embodiments of the present invention, and are defined as follows: 
(1) Fab, the fragment which contains a monovalent antigen-binding fragment 

30 of an antibody molecule, can be produced by digestion of whole antibody with the 
enzyme papain to yield an intact light chain and a portion of one heavy chain; 
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(2) Fab\ the fragment of an antibody molecule that can be obtained by treating 
whole antibody with pepsin, followed by reduction, to yield an intact light chain and a 
portion of the heavy chain; two Fab 1 fragments are obtained per antibody molecule; 

(3) (Fab ! ) 2 , the fragment of the antibody that can be obtained by treating whole 
5 antibody with the enzyme pepsin without subsequent reduction; F(ab')2 is a dimer of 

two Fab' fragments held together by two disulfide bonds; 

(4) Fv, defined as a genetically engineered fragment containing the variable 
region of the light chain and the variable region of the heavy chain expressed as two 
chains; and 

10 (5) Single chain antibody ("SCA"), a genetically engineered molecule 

containing the variable region of the light chain and the variable region of the heavy 
chain, linked by a suitable polypeptide linker as a genetically fused single chain 
molecule as described in, for example, U.S. Patent 4,946,778. 

SIM2-specific antibodies can be commercially obtained from Santa Cruz 

15 [SM21 (C-17):sc-8716; SIM2s (C-15):sc-8715]. 

Alternatively, S1M2 specific antibodies may be generated using methods, 
which are well known in the art. See for example, Harlow and Lane, Antibodies: A 
Laboratory Manual, Cold Spring Harbor Laboratory, New York, 1988, incorporated 
herein by reference). 

20 Purification of serum immunoglobulin antibodies (polyclonal antisera) or 

reactive portions thereof can be accomplished by a variety of methods known to those 
of skill including, precipitation by ammonium sulfate or sodium sulfate followed by 
dialysis against saline, ion exchange chromatography, affinity or immunoaffinity 
chromatography as well as gel filtration, zone electrophoresis, etc. (see Goding in, 

25 Monoclonal Antibodies: Principles and Practice, 2nd ed., pp. 104-126, 1986, Orlando, 
Fla., Academic Press). Under normal physiological conditions antibodies are found in 
plasma and other body fluids and in the membrane of certain cells and are produced by 
lymphocytes of the type denoted B cells or their functional equivalent. Antibodies of 
the IgG class are made up of four polypeptide chains linked together by disulfide 

30 bonds. The four chains of intact IgG molecules are two identical heavy chains referred 
to as H-chains and two identical light chains referred to as L-chains. Additional 
classes include IgD, IgE, IgA, IgM and related proteins. 
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Methods of generating and isolating monoclonal antibodies are well known in 
the art, as summarized for example in reviews such as Tramontano and Schloeder, 
Methods in Enzymology 178, 551-568, 1989. A recombinant SIM2 polypeptide may 
be used to generate antibodies in vitro. More preferably, the recombinant is used to 
5 elicit antibodies in vivo. In general, a suitable host animal is immunized with the 
recombinant SIM2. Advantageously, the animal host used is a mouse of an inbred 
strain. Animals are typically immunized with a mixture comprising a solution of the 
recombinant SIM2 in a physiologically acceptable vehicle, and any suitable adjuvant, 
which achieves an enhanced immune response to the immunogen. By way of 
10 example, the primary immunization conveniently may be accomplished with a mixture 
of a solution of the recombinant SIM2 and Freund's complete adjuvant, said mixture 
being prepared in the form of a water in oil emulsion. Typically the immunization will 
be administered to the animals intramuscularly, intradermally, subcutaneously, 
intraperitoneally, into the footpads, or by any appropriate route of administration. The 
15 immunization schedule of the immunogen may be adapted as required, but customarily 
involves several subsequent or secondary immunizations using a milder adjuvant such 
as Freund's incomplete adjuvant. Antibody titers and specificity of binding to the 
SIM2 can be determined during the immunization schedule by any convenient method 
including by way of example radioimmunoassay, or enzyme linked immunosorbant 
assay, which is known as the ELISA assay. When suitable antibody titers are 
achieved, antibody-producing lymphocytes from the immunized animals are obtained, 
and these are cultured, selected and cloned, as is known in the art. Typically, 
lymphocytes may be obtained in large numbers from the spleens of immunized 
animals, but they may also be retrieved from the circulation, the lymph nodes or other 
25 lymphoid organs. Lymphocytes are then fused with any suitable myeloma cell line, to 
yield hybridomas, as is well known in the art. Alternatively, lymphocytes may also be 
stimulated to grow in culture, and may be immortalized by methods known in the art 
including the exposure of these lymphocytes to a virus, a chemical or a nucleic acid 
such as an oncogene, according to established protocols. After fusion, the hybridomas 
are cultured under suitable culture conditions, for example in multi-well plates, and the 
culture supematants are screened to identify cultures containing antibodies that 
recognize the hapten of choice. Hybridomas that secrete antibodies that recognize the 
recombinant SIM2 are cloned by limiting dilution and expanded, under appropriate 
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culture conditions. Monoclonal antibodies are purified and characterized in terms of 
immunoglobulin type and binding affinity. 

Antibody fragments according to the present invention can be prepared by 
proteolytic hydrolysis of the antibody or by expression in E. coli or mammalian cells 

5 (e.g. Chinese hamster ovary cell culture or other protein expression systems) of DNA 
encoding the fragment. 

Antibody fragments can be obtained by pepsin or papain digestion of whole 
antibodies by conventional methods. For example, antibody fragments can be 
produced by enzymatic cleavage of antibodies with pepsin to provide a 5S fragment 

10 denoted F(ab02. This fragment can be further cleaved using a thiol reducing agent, and 
optionally a blocking group for the sulfhydryl groups resulting from cleavage of 
disulfide linkages, to produce 3.5S Fab 1 monovalent fragments. Alternatively, an 
enzymatic cleavage using pepsin produces two monovalent Fab' fragments and an Fc 
fragment directly. These methods are described, for example, by Goldenberg, in U.S. 

15 Pat. Nos. 4,036,945 and 4,331,647, and references contained therein, which patents are 
hereby incorporated by reference in their entirety (see also Porter, R. R., Biochem. J., 
73: 119-126, 1959). Other methods of cleaving antibodies, such as separation of heavy 
chains to form monovalent light-heavy chain fragments, further cleavage of fragments, 
or other enzymatic, chemical, or genetic techniques may also be used, so long as the 

20 fragments bind to the antigen that is recognized by the intact antibody. 

Fv fragments comprise an association of Vh and Vl chains. This association 
may be noncovalent, as described in Inbar et al. (Proc. Natl Acad. Sci. USA 69:2659- 
62, 1972). Alternatively, the variable chains can be linked by an intermolecular 
disulfide bond or cross-linked by chemicals such as glutaraldehyde. Preferably, the Fv 

25 fragments comprise Vh and Vl chains connected by a peptide linker. These single- 
chain antigen binding proteins (sFv) are prepared by constructing a structural gene 
comprising DNA sequences encoding the Vh and V L domains connected by an 
oligonucleotide. The structural gene is inserted into an expression vector, which is 
subsequently introduced into a host cell such as E. coli. The recombinant host cells 

30 synthesize a single polypeptide chain with a linker peptide bridging the two V 
domains. Methods for producing sFvs are described, for example, by Whitlow and 
Filpula, Methods, 2: 97-105, 1991; Bird et al., Science 242:423-426, 1988; Pack et al., 
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Bio/Technology 11:1271-77, 1993; and Ladner et al., U.S. Pat. No. 4,946,778, all of 
which are hereby incorporated, by reference, in entirety. 

Another form of an antibody fragment is a peptide coding for a single 
complementarity-determining region (CDR). CDR peptides ("minimal recognition 
5 units") can be obtained by constructing genes encoding the CDR of an antibody of 
interest. Such genes are prepared, for example, by using the polymerase chain reaction 
to synthesize the variable region from RNA of antibody-producing cells (see, for 
example, Larrick and Fry Methods, 2: 106-10, 1991). 

Humanized forms of non-human (e.g., murine) antibodies are chimeric 
10 molecules of immunoglobulins, immunoglobulin chains or fragments thereof (such as 
Fv, Fab, Fab', F(ab')a or other antigen-binding subsequences of antibodies) which 
contain minimal sequence derived from non-human immunoglobulin. Humanized 
antibodies include human immunoglobulins (recipient antibody) in which residues 
form a complementary determining region (CDR) of the recipient are replaced by 
15 residues from a CDR of a non-human species (donor antibody) such as mouse, rat or 
rabbit having the desired specificity, affinity and capacity. In some instances, Fv 
framework residues of the human immunoglobulin are replaced by corresponding non- 
human residues. Humanized antibodies may also comprise residues, which are found 
neither in the recipient antibody nor in the imported CDR or framework sequences, ha 
20 general, the humanized antibody will comprise substantially all of at least one, and 
typically two, variable domains, in which all or substantially all of the CDR regions 
correspond to those of a non-human immunoglobulin and all or substantially all of the 
FR regions are those of a human immunoglobulin consensus sequence. The humanized 
antibody optimally also will comprise at least a portion of an immunoglobulin constant 
25 region (Fc), typically that of a human immunoglobulin [Jones et al., Nature, 321 :522- 
525 (1986); Riechmann et al., Nature, 332:323-329 (1988); and Presta, Curr. Op. 
Struct. Biol., 2:593-596 (1992)]. 

Methods for humanizing non-human antibodies are well known in the art. 
Generally, a humanized antibody has one or more amino acid residues introduced into 
30 it from a source, which is non-human. These non-human amino acid residues are often 
referred to as import residues, which are typically taken from an import variable 
domain. Humanization can be essentially performed following the method of Winter 
and co-workers [Jones et al., Nature, 321:522-525 (1986); Riechmann et al., Nature 
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332:323-327 (1988); Verhoeyen et al., Science, 239:1534-1536 (1988)], by 
substituting rodent CDRs or CDR sequences for the corresponding sequences of a 
human antibody. Accordingly, such humanized antibodies are chimeric antibodies 
(U.S. Pat. No.. 4,816,567), wherein substantially less than an intact human variable 
5 domain has been substituted by the corresponding sequence from a non-human 
species. In practice, humanized antibodies are typically human antibodies in which 
some CDR residues and possibly some FR residues are substituted by residues from 
analogous sites in rodent antibodies. 

Human antibodies can also be produced using various techniques known in the 

10 art, including phage display libraries [Hoogenboom and Winter, J. Mol. Biol., 227:381 
(1991); Marks et al., J. Mol. Biol., 222:581 (1991)]. The techniques of Cole et al. and 
Boerner et al. are also available for the preparation of human monoclonal antibodies 
(Cole et al., Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, p. 77 (1985) 
and Boerner et al., J. Immunol., 147(l):86-95 (1991)]. Similarly, human monoclonal 

15 antibodies can be made by introducing human immunoglobulin loci into transgenic 
animals, e.g., mice in which the endogenous immunoglobulin genes have been 
partially or completely inactivated. Upon challenge, human antibody production is 
observed, which closely resembles that seen in humans in all respects, including gene 
rearrangement, assembly, and antibody repertoire. This approach is described, for 

20 example, in U.S. Pat. Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; 
5,661,016, and in the following scientific publications: Marks et al., Bio/Technology 
10, 779-783 (1992); Lonberg et al., Nature 368 856-859 (1994); Morrison, Nature 368 
812-13 (1994); Fishwild et al., Nature Biotechnology 14, 845-51 (1996); Neuberger, 
Nature Biotechnology 14, 826 (1996); Lonberg and Huszar, Intern. Rev. Immunol. 13 

25 65-93 (1995). 

Once the level of SIM2 is determined and the subject diagnosed, the diagnosis 
can be further validated using other diagnostic methods, which may also provide an 
accurate staging of the disease in the case of a positive diagnosis. Thus, for example, 
positive diagnosis for ovarian cancer may be confirmed by 
30 transabdominal/transvaginal ultrasonography and/or a blood test for the tumor marker 
CA-125. Ultrasound screening involves looking for enlarged ovaries and a 
transvaginal colour Doppler ultrasound is used to image blood flow. Blood vessel 
formation is thought to discriminate between cancer and benign cysts. Alternatively, 
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positive diagnosis of lung cancer using the method of the present invention can be 
validated using chest X-rays and sputum cytology as well as spiral computed 
tomography (CT) scanning which can detect even very small tumors. Positive 
diagnosis of breast cancer using the method of the present invention can be validated 
5 using scintimammography, mammography, ultrasound and/or magnetic resonance 
imaging (MRI). 

It will be appreciated that the above-described method of this aspect of the 
present invention may also be used to monitor disease progression and therapeutic 
regimen. 

10 In addition to diagnostic advances pioneered by the present invention, the 

identification of overexpression of SIM2 in ovarian and lung cancers allows for the 
design of therapeutic agents, which can be used to treat ovarian, breast and lung 
cancers. 

Thus, according to another aspect of the present invention there is provided a 
15 method of treating a subject (i.e., mammal e.g., human) having ovarian cancer, breast 
and/or lung cancer. 

As used herein the term 'treating" refers to preventing, curing, reversing, 
attenuating, alleviating, minimizing, suppressing or halting the deleterious effects of 
ovarian cancer and/or lung cancer. 

20 The method according to this aspect of the present invention is effected by 

specifically downregulating expression or activity of SIM2 in a lung tissue, breast 
tissue and/or ovarian tissue to thereby treat the ovarian cancer, breast cancer and/or 
lung cancer in the subject. 

Preferably, the method is effected by providing to the subject a therapeutically 

25 effective amount of an agent which is capable of downregulating SIM2 expression 
and/or activity. 

As used herein "an agent capable of downregulating SIM2 expression and/or 
activity" refers to a molecule, which is capable of directly or indirectly downregulating 
S1M2 expression or activity. An agent for direct downregulation of S1M2 refers to a 
30 molecule, which inhibits SIM2 intrinsic activity or expression. An agent for indirect 
downregulation of SIM2 refers to a molecule which inhibits the activity of a SIM2 
effector (e.g., ARNT and HIF-la) or expression thereof. The agents according to this 
aspect of the present invention can be a molecule which binds SIM2 (e.g., an 
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antibody); an enzyme which cleaves SIM2; an antisense polynucleotide enable of 
specifically hybridizing with an mRNA transcript encoding SIM2; a SIM2 specific 
aptamer 3 a ribozyme which specifically cleaves SIM2 transcripts; a non-functional 
analogue of at least a catalytic or binding portion of SIM2 (e.g., a peptide-agent which 

5 can be identified using a phage display technology); a molecule which prevents SIM2 
activation or substrate binding; an siKNA molecule capable of inducing degradation of 
SIM2 transcripts; a DNAzyme which specifically cleaves SIM2 transcripts or DNA, 
an activated double-stranded RNA (dsRNA-dependent protein kinase PKR as 
described in [Shir and Levitzki (2002) Nat. Biotechnol. 20(9):895-900 and Cell Mol 

10 Neurobiol. (2001) 21(6):645-56] and a molecule which is capable of promoting S1M2 
specific immunization response. 

One example, of an agent capable of downregulating a SIM2 is an antibody or 
antibody fragment capable of specifically binding SIM2 or an effector thereof. 
Examples of SIM2 antibodies are described hereinabove. Preferably, such antibodies 

15 are directed at functional domains of a target protein. Thus, for example, a SIM2 
antibody according to this aspect of the present invention is preferably directed at the 
effector binding domain such as the ARNT binding domain. Alternatively, the 
antibody may bind a SIM2 effector such as ARNT or HIF-la. An anti ARNT 
polyclonal rabbit serum raised against residues 1-140 of human ARNT and an anti 

20 HIF-la polyclonal serum raised against residues 786-826 of human HIF-la have been 
described by Susan (2002) J. Biol. Chem. 277:10236-10243. Preferably, the 
antibodies, according to this aspect of the present invention, are humanized such as 
described hereinabove. 

Alternatively an agent capable of downregulating a SIM2 or an effector thereof 

25 can be a protease, which is designed to cleave SIM2. Proteases which can be used to 
cleave SIM2 can be identified by performing a computational analysis such as by 
using the SMART, MEME, MOTIFS, CDD-NCBI, BLOCKS or mPredict software 
each available from http://molbio.info.nih.gov/talks/tools/jobs.html and identifying a 
protease cleavage site. 

30 Alternatively, agents which are designed to inhibit functional domains in the 

SIM2 protein (i.e., protein-protein interaction domains) can be computationally 
identified. For example, various peptide sequences derived from SIM2 can be 
computationally analyzed for an ability to bind an inhibitor using a variety of three 
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dimensional computational tools. Software programs useful for displaying three- 
dimensional structural models, such as RIBBONS (Carson, M., 1997. Methods in 
Enzymology 277, 25), O (Jones, TA. et al y 1991. Acta Crystallogr. A47, 110), DINO 
(DINO: Visualizing Structural Biology (2001) http://www.dino3d.org); and 

5 QUANTA, INSIGHT, SYBYL, MACROMODE, ICM, MOLMOL, RASMOL and 
GRASP (reviewed in Kraulis, J., 1991. Appl Crystallogr. 24, 946) can be utilized to 
model interactions between SIM2 and prospective peptide and/or other small 
molecule inhibitors. Computational modeling of protein-peptide interactions has been 
successfully used in rational drug design, for further detail, see Lam et al., 1994. 

10 Science 263, 380; Wlodawer et al., 1993. Ann Rev Biochem. 62, 543; Appelt, 1993. 
Perspectives in Drug Discovery and Design 1, 23; Erickson, 1993. Perspectives in 
Drug Discovery and Design 1, 109, and Mauro MJ. et al, 2002. J Clin Oncol. 20, 
325-34. Specifically, the PRO_SELECT, tool for the virtual screening of libraries for 
fit to a protein active site, has been used to find novel leads against the serine protease 

15 factor Xa [Liebeschuetz J Med Chem. (2002) ;45(6): 1221-32]. 

Another agent capable of downregulating a SIM2 or an effector thereof is a 
small interfering RNA (siRNA) molecule. RNA interference is a two-step process, 
the first step, which is termed as the initiation step, input dsRNA is digested into 21-23 
nucleotide (nt) small interfering RNAs (siRNA), probably by the action of Dicer, a 

20 member of the RNase HI family of dsRNA-specific ribonucleases, which processes 
(cleaves) dsRNA (introduced directly or via a transgene or a virus) in an ATP- 
dependent manner. Successive cleavage events degrade the RNA to 19-21 bp 
duplexes (siRNA), each with 2-nucleotide 3* overhangs [Hutvagner and Zamore Curr. 
Opin. Genetics and Development 12:225-232 (2002); and Bernstein Nature 409:363- 

25 366 (2001)]. 

In the effector step, the siRNA duplexes bind to a nuclease complex to from 
the RNA-induced silencing complex (RISC). An ATP-dependent unwinding of the 
siRNA duplex is required for activation of the RISC. The active RISC then targets the 
homologous transcript by base pairing interactions and cleaves the mRNA into 12 
30 nucleotide fragments from the V terminus of the siRNA [Hutvagner and Zamore Curr. 
Opin. Genetics and Development 12:225-232 (2002); Hammond et al. (2001) Nat. 
Rev. Gen. 2:110-119 (2001); and Sharp Genes. Dev. 15:485-90 (2001)]. Although the 
mechanism of cleavage is still to be elucidated, research indicates that each RISC 
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contains a single siRNA and an RNase [Hutvagner and Zamore Curr. Opin. Genetics 

and Development 12:225-232 (2002)]. 

Because of the remarkable potency of RNAi, an amplification step within the 

RNAi pathway has been suggested. Amplification could occur by copying of the input 
5 dsRNAs which would generate more siRNAs, or by replication of the siRNAs formed. 

Alternatively or additionally, amplification could be effected by multiple turnover 

events of the RISC [Hammond et ah Nat. Rev. Gen. 2:110-119 (2001), Sharp Genes. 

Dev. 15:485-90 (2001); Hutvagner and Zamore Curr. Opin. Genetics and 

Development 12:225-232 (2002)]. For more information on RNAi see the following 
10 reviews Tuschl ChemBiochem. 2:239-245 (2001); Cullen Nat. Immunol. 3:597-599 

(2002); and Brantl Biochem. Biophys. Act. 1575:15-25 (2002). 

Synthesis of RNAi molecules suitable for use with the present invention can be 

effected as follows. First, the SIM2 mRNA sequence of interest (e.g., SEQ ID NO: 7) 

is scanned downstream of the AUG start codon for AA dinucleotide sequences. 
15 Occurrence of each AA and the 3' adjacent 19 nucleotides is recorded as potential 

siRNA target sites. Preferably, siRNA target sites are selected from the open reading 

frame, as untranslated regions (UTRs) are richer in regulatory protein binding sites. 

UTR-binding proteins and/or translation initiation complexes may interfere with 

binding of the siRNA endonuclease complex [Tuschl ChemBiochem. 2:239-245]. It 
20 will be appreciated though, that siRNAs directed at untranslated regions may also be 

effective, as demonstrated for GAPDH wherein siRNA directed at the 5 5 UTR 

mediated about 90 % decrease in cellular GAPDH mRNA and completely abolished 

protein level (www.ambion.com/techhb/tn/9 1/91 2.html) . 

Second, potential target sites are compared to an appropriate genomic database 
25 (e.g., human, mouse, rat etc.) using any sequence alignment software, such as the 

BLAST software available from the NCBI server (www.ncbi.nlm.nih. gov/BLAST/) . 

Putative target sites which exhibit significant homology to other coding sequences are 

filtered out. 

Qualifying target sequences are selected as template for siRNA synthesis. 
30 Preferred sequences are those including low G/C content as these have proven to be 
more effective in mediating gene silencing as compared to those with G/C content 
higher than 55 %. Several target sites are preferably selected along the length of the 
target gene for evaluation. siRNA target sites on SEQ ID NO: 1 are preferably 



WO 2004/012847 




PCT/IL2003/000636 



31 

selected from the 5' end i.e., coordinates 1-304 (see Figure 1) e.g., nucleotide 
coordinates 171-193 of SEQ ID NO: 1. For better evaluation of the selected siRNAs, 
a negative control is preferably used in conjunction. Negative control siRNA 
preferably include the same nucleotide composition as the siRNAs but lack significant 
5 homology to the genome. Thus, a scrambled nucleotide sequence of the siRNA is 
preferably used, provided it does not display any significant homology to any other 
gene. 

Another agent capable of downregulating a SIM2 or an effector thereof is a 
DNA2yme molecule capable of specifically cleaving an mRNA transcript or DNA 

10 sequence of the SIM2. DNAzymes are single-stranded polynucleotides which are 
capable of cleaving both single and double stranded target sequences (Breaker, R.R. 
and Joyce, G. Chemistry and Biology 1995;2:655; Santoro, S.W. & Joyce, G.F. Proc. 
Natl, Acad. Sci. USA 1997;943:4262) A general model (the "10-23" model) for the 
DNAzyme has been proposed. "10-23" DNAzymes have a catalytic domain of 15 

15 deoxyribonucleotides (e.g., nucleotide coordinates 3 1 1-326 of SEQ ID NO: 1), flanked 
by two substrate-recognition domains of seven to nine deoxyribonucleotides each. 
This type of DNAzyme can effectively cleave its substrate RNA at purinerpyrimidine 
junctions (Santoro, S.W. & Joyce, G.F. Proc. Natl, Acad. Sci. USA 199; for rev of 
DNAzymes see Khachigian, LM [Curr Opin Mol Ther 4:1 19-21 (2002)]. 

20 Examples of construction and amplification of synthetic, engineered 

DNAzymes recognizing single and double-stranded target cleavage sites have been 
disclosed in U.S. Pat. No. 6,326,174 to Joyce et ah DNAzymes of similar design 
directed against the human Urokinase receptor were recently observed to inhibit 
Urokinase receptor expression, and successfully inhibit colon cancer cell metastasis in 

25 vivo (Itoh et al , 20002, Abstract 409, Ann Meeting Am Soc Gen Ther 
www.asgt.org\ In another application, DNAzymes complementary to bcr-abl 
oncogenes were successful in inhibiting the oncogenes expression in leukemia cells, 
and lessening relapse rates in autologous bone marrow transplant in cases of CML and 
ALL. 

30 Downregulation of a SIMS or an effector thereof can also be effected by using 

an antisense polynucleotide capable of specifically hybridizing with an mRNA 
transcript encoding the SIM2 transcripts. 



WO 2004/012847 




PCT/IL2003/000636 



32 

Design of antisense molecules, which can be used to efficiently downregulate a 
SIM2 must be effected while considering two aspects important to the antisense 
approach. The first aspect is delivery of the oligonucleotide into the cytoplasm of the 
appropriate cells, while the second aspect is design of an oligonucleotide which 

5 specifically binds the designated mRNA within cells in a way which inhibits 
translation thereof. An example antisense oligonucleotide which can be used in 
accordance with the present invention is designed to hybridize to nucleotide 
coordinates 311-400 of SEQ ID NO: 2. Such an antisense molecule hybridizes to a 
sequence which is shared by all SIM2 expression products known to date and as such 

10 may be useful in silencing SIM2 expression. 

The prior art teaches of a number of delivery strategies which can be used to 
efficiently deliver oligonucleotides into a wide variety of cell types [see, for example, 
Luft J Mol Med 76: 75-6 (1998); Kronenwett et al Blood 91: 852-62 (1998); Rajur et 
al Bioconjug Chem 8: 935-40 (1997); Lavigne et al. Biochem Biophys Res Commun 

15 237: 566-71 (1997); and Aoki et al (1997) Biochem Biophys Res Commun 231: 540- 
5 (1997)]. 

In addition, algorithms for identifying those sequences with the highest 
predicted binding affinity for their target mRNA based on a thermodynamic cycle that 
accounts for the energetics of structural alterations in both the target mRNA and the 
20 oligonucleotide are also available [see, for example, Walton et al. Biotechnol Bioeng 
65: 1-9(1999)]. 

Such algorithms have been successfully used to implement an antisense 
approach in cells. For example, the algorithm developed by Walton et al. enabled 
scientists to successfully design antisense oligonucleotides for rabbit beta-globin 

25 (RBG) and mouse tumor necrosis factor-alpha (TNF alpha) transcripts. The same 
research group has more recently reported that the antisense activity of rationally 
selected oligonucleotides against three model target mRNAs (human lactate 
dehydrogenase A and B and rat gpl30) in cell culture as evaluated by a kinetic PCR 
technique proved effective in almost all cases, including tests against three different 

30 targets in two cell types with phosphodiester and phosphorothioate oligonucleotide 
chemistries. 
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In addition, several approaches for designing and predicting efficiency of 
specific oligonucleotides using an in vitro system were also published (Matveeva et 
aL, Nature Biotechnology 16: 1374 - 1375 (1998)]. 

Several clinical trials have demonstrated safety, feasibility and activity of 

5 antisense oligonucleotides. For example, antisense oligonucleotides suitable for the 
treatment of cancer have been successfully used [Holmund et aL, Curr Opin Mol Ther 
1:372-85 (1999)], while treatment of hematological malignancies via antisense 
oligonucleotides targeting c-myb gene, p53 and Bcl-2 had entered clinical trials and 
had been shown to be tolerated by patients [Gerwitz Curr Opin Mol Ther 1:297-306 

10 (1999)]. 

More recently, antisense-mediated suppression of human heparanase gene 
expression has been reported to inhibit pleural dissemination of human cancer cells in 
a mouse model [Uno et aL, Cancer Res 61:7855-60 (2001)]. 

Thus, the current consensus is that recent developments in the field of antisense 
15 technology which, as described above, have led to the generation of highly accurate 
antisense design algorithms and a wide variety of oligonucleotide delivery systems, 
enable an ordinarily skilled artisan to design and implement antisense approaches 
suitable for downregulating expression of known sequences without having to resort to 
undue trial and error experimentation. 
20 It will be appreciated that antisense oligonucleotides can be used to modulate 

alternative splicing from SIM2 gene [Suzani and Kole (2003) Progress in Molecular 
and Subcellular Biology vol. 31 Philippe Jeanteur (Ed.) Springer-Verlag Berlin 
Heidelberg]. Inhibition of splicing by antisense oligonucleotides can be accomplished 
by targeting oligonucleotides to small nuclear RNAs (snRNAs), which participate in 
25 spliceosome formation and are essential for splicing and to splice sites and adjacent 
sequences [reviewed in Kole (1991) Adv. Drug Delivery Rev. 6:271-286]. 

Another agent capable of downregulating a SIM2 or an effector thereof is a 
ribozyme molecule capable of specifically cleaving an mRNA transcript encoding a 
SIM2 for example. Ribozymes are being increasingly used for the sequence-specific 
30 inhibition of gene expression by the cleavage of mRNAs encoding proteins of interest 
[Welch et aL, Curr Opin Biotechnol. 9:486-96 (1998)]. The possibility of designing 
ribozymes to cleave any specific target RNA has rendered them valuable tools in both 
basic research and therapeutic applications. In the therapeutics area, ribozymes have 
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been exploited to target viral RNAs in infectious diseases, dominant oncogenes in 
cancers and specific somatic mutations in genetic disorders [Welch et aL, Clin Diagn 
Virol. 10:163-71 (1998)]. Most notably, several ribozyme gene therapy protocols for 
HIV patients are already in Phase 1 trials. More recently, ribozymes have been used 
for transgenic animal research, gene target validation and pathway elucidation. 
Several ribozymes are in various stages of clinical trials. ANGIOZYME was the first 
chemically synthesized ribozyme to be studied in human clinical trials. 
ANGIOZYME specifically inhibits formation of the VEGF-r (Vascular Endothelial 
Growth Factor receptor), a key component in the angiogenesis pathway. Ribozyme 
Pharmaceuticals, Inc., as well as other firms have demonstrated the importance of anti- 
angiogenesis therapeutics in animal models. HEPTAZYME, a ribozyme designed to 
selectively destroy Hepatitis C Virus (HCV) RNA, was found effective in decreasing 
Hepatitis C viral RNA in cell culture assays (Ribozyme Pharmaceuticals, Incorporated 
- WEB home page). 

Alternatively, the agent can be a molecule, which promotes a SIM2-specific 
immunogenic response in the subject. The molecule can be a SIM2 protein, a 
fragment derived therefrom or a nucleic acid sequence encoding thereof. Although 
such a molecule can be provided to the subject per se, the agent is preferably 
administered with an immunostimulant in an immunogenic composiiton. An 
immunostimulant may be any substance that enhances or potentiates an immune 
response (antibody and/or cell-mediated) to an exogenous antigen. Examples of 
immunostimulants include adjuvants, biodegradable microspheres (e.g., polylactic 
galactide) and liposomes into which the compound is incorporated (see e.g., U.S. Pat. 
No. 4,235,877). Vaccine preparation is generally described in, for example, M. F. 
Powell and M. J. Newman, eds., "Vaccine Design (the subunit and adjuvant 
approach)," Plenum Press (NY, 1995). 

Illustrative immunogenic compositions may contain DNA encoding one or 
more of the SIM2 polypeptides as described above, such that the polypeptide is 
generated in situ. The DNA may be present within any of a variety of delivery 
systems known to those of ordinary skill in the art, including nucleic acid expression 
systems (see below), bacteria and viral expression systems. Numerous gene delivery 
techniques are well known in the art, such as those described by Rolland, Crit. Rev. 
Therap. Drug Carrier Systems 15:143-198, 1998, and references cited therein. 
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Appropriate nucleic acid expression systems contain the necessary DNA sequences 
for expression in the subject (such as a suitable promoter and terminating signal). 
Bacterial delivery systems involve the administration of a bacterium (such as 
BaciUus-Calmette-Guerrin) that expresses an immunogenic portion of the polypeptide 
5 on its cell surface or secretes such an epitope. In a preferred embodiment, the DNA 
may be introduced using a viral expression system (e.g., vaccinia or other pox virus, 
retrovirus, or adenovirus), which may involve the use of a non-pathogenic (defective), 
replication competent virus. Suitable systems are disclosed, for example, in Fisher- 
Hoch et al., Proc. Natl. Acad. Sci. USA 86:317-321, 1989; Flexner et al., Ann. N.Y 

10 Acad. Sci. 569:86-103, 1989; Flexner et al., Vaccine 8:17-21, 1990; U.S. Pat. Nos. 
4,603,112, 4,769,330, and 5,017,487; WO 89/01973; U.S. Pat. No. 4,777,127; GB 
2,200,651; EP 0,345,242; WO 91/02805; Berkner, Biotechniques 6:616-627, 1988; 
Rosenfeld et al., Science 252:431-434, 1991; Kolls et al., Proc. Natl. Acad. Sci. USA 
91:215-219, 1994; Kass-Eisler et al., Proc. Natl. Acad. Sci. USA 90:11498-11502, 

15 1993; Guzman et al., Circulation 88:2838-2848, 1993; and Guzman et al., Cir. Res. 
73:1202-1207, 1993. Techniques for incorporating DNA into such expression systems 
are well known to those of ordinary skill in the art. The DNA may also be "naked," as 
described, for example, in Ulmer et al., Science 259:1745-1749, 1993 and reviewed 
by Cohen, Science 259:1691-1692, 1993. The uptake of naked DNA may be 

20 increased by coating the DNA onto biodegradable beads, which are efficiently 
transported into the cells. 

It will be appreciated that an immunogenic composition may comprise both a 
polynucleotide and a polypeptide component. Such immunogenic compositions may 
provide for an enhanced immune response. 

25 Any of a variety of immunostimulants may be employed in the immunogenic 

compositions of this invention. For example, an adjuvant may be included. Most 
adjuvants contain a substance designed to protect the antigen from rapid catabolism, 
such as aluminum hydroxide or mineral oil, and a stimulator of immune responses, 
such as lipid A, Bortadella pertussis or Mycobacterium tuberculosis derived proteins. 

30 Suitable adjuvants are commercially available as, for example, Freund's Incomplete 
Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, Mich.); Merck 
Adjuvant 65 (Merck and Company, Inc., Rahway, N.J.); AS-2 (SmithKline Beecham, * 
Philadelphia, Pa.); aluminum salts such as aluminum hydroxide gel (alum) or 
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aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of 
acylated tyrosine; acylated sugars; cationically or anionically derivatized 
polysaccharides; polyphosphazenes; biodegradable microspheres; monophosphoryl 
lipid A and quil A. Cytokines, such as GM-CSF or interleukin-2,-7, or -12, may also 

5 be used as adjuvants. 

The adjuvant composition may be designed to induce an immune response 
predominantly of the Thl type. High levels of Thl-type cytokines (e.g., IFN-. gamma., 
TNF.alpha., EL-2 and IL-12) tend to favor the induction of cell mediated immune 
responses to an administered antigen. In contrast, high levels of Th2-type cytokines 

10 (e.g., IL-4, IL-5, IL-6 and IL-10) tend to favor the induction of humoral immune 
responses. Following application of an immunogenic composition as provided herein, 
the subject will support an immune response that includes Thl- and Th2-type 
responses. The levels of these cytokines may be readily assessed using standard 
assays. For a review of the families of cytokines, see Mosmann and Coffinan, Ann. 

15 Rev. Immunol. 7:145-173, 1989. 

Preferred adjuvants for use in eliciting a predominantly Thl-type response 
include, for example, a combination of monophosphoryl lipid A, preferably 3-de-O- 
acylated monophosphoryl lipid A (3D-MPL), together with an aluminum salt. MPL 
adjuvants are available from Corixa Corporation (Seattle, Wash.; see U.S. Pat. Nos. 

20 4,436,727; 4,877,611; 4,866,034 and 4,912,094). CpG-containing oligonucleotides 
(in which the CpG dinucleotide is unmethylated) also induce a predominantly Thl 
response. Such oligonucleotides are well known and are described, for example, in 
WO 96/02555, WO 99/33488 and U.S. Pat. Nos. 6,008,200 and 5,856,462. 
Immunostimulatory DNA sequences are also described, for example, by Sato et al., 

25 Science 273:352, 1996. Another preferred adjuvant is a saponin, preferably QS21 
(Aquila Biopharmaceuticals Inc., Framingham, Mass.), which may be used alone or in 
combination with other adjuvants. For example, an enhanced system involves the 
combination of a monophosphoryl lipid A and saponin derivative, such as the 
combination of QS21 and 3D-MPL as described in WO 94/00153, or a less 

30 reactogenic composition where the QS21 is quenched with cholesterol, as described in 
WO 96/33739. Other preferred formulations comprise an oil-in-water emulsion and 
tocopherol. A particularly potent adjuvant formulation involving QS21, 3D-MPL and 
tocopherol in an oil-in-water emulsion is described in WO 95/17210. 
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Other preferred adjuvants include Montanide ISA 720 (Seppic, France), SAF 
(Chiron, Calif., United States), ISCOMS (CSL), MF-59 (Chiron), the SBAS series of 
adjuvants (e.g., SBAS-2 or SBAS-4, available from SmithKline Beecham, Rixensart, 
Belgium), Detox (Corixa, Hamilton, Mont.), RC-529 (Corixa, Hamilton, Mont.) and 

5 other aminoalkyl glucosaminide 4-phosphates (AGPs), such as those described in 
pending U.S. patent application Ser. Nos. 08/853,826 and 09/074,720. 

A delivery vehicle may be employed within the immunogenic composition of 
the present invention to facilitate production of an antigen-specific immune response 
that targets tumor cells. Delivery vehicles include antigen presenting cells (APCs), 

10 such as dendritic cells, macrophages, B cells, monocytes and other cells that may be 
engineered to be efficient APCs. Such cells may be genetically modified to increase 
the capacity for presenting the antigen, to improve activation and/or maintenance of 
the T cell response, to have anti-tumor effects per se and/or to be immunologically 
compatible with the receiver (i.e., matched HLA haplotype). APCs may generally be 

15 isolated from any of a variety of biological fluids and organs, including tumor and 
peritumoral tissues, and may be autologous, allogeneic, syngeneic or xenogeneic 
cells. 

Dendritic cells are highly potent APCs (Banchereau and Steinman, Nature 
392:245-251, 1998) and have been shown to be effective as a physiological adjuvant 

20 for eliciting prophylactic or therapeutic antitumor immunity (see Timmernan and 
Levy, Ann. Rev. Med. 50:507-529, 1999). In general, dendritic cells maybe identified 
based on their typical shape (stellate in situ, with marked cytoplasmic processes 
(dendrites) visible in vitro), their ability to take up, process and present antigens with 
high efficiency and their ability to activate naive T cell responses. Dendritic cells 

25 may, of course, be engineered to express specific cell-surface receptors or ligands that 
are not commonly found on dendritic cells in vivo or ex vivo, and such modified 
dendritic cells are contemplated by the present invention. As an alternative to 
dendritic cells, secreted vesicles antigen-loaded dendritic cells (called exosomes) may 
be used within an immunogenic composition (see Zitvogel et al., Nature Med. 4:594- 

30 600, 1998). 

Dendritic cells and progenitors may be obtained from peripheral blood, bone 
marrow, tumor-infiltrating cells, peritumoral tissues-infiltrating cells, lymph nodes, 
spleen, skin, umbilical cord blood or any other suitable tissue or fluid. For example, 
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dendritic cells may be differentiated ex vivo by adding a combination of cytokines 
such as GM-CSF, IL-4, IL-13 and/or TNF.alpha. to cultures of monocytes harvested 
from peripheral blood. Alternatively, CD34 positive cells harvested from peripheral 
blood, umbilical cord blood or bone marrow may be differentiated into dendritic cells 
5 by adding to the culture medium combinations of GM-CSF, IL-3, TNF.alpha., CD40 
ligand, LPS, flt3 ligand and/or other compound(s) that induce differentiation, 
maturation and proliferation of dendritic cells. 

Dendritic cells are categorized as "immature" and "mature" cells, which allows 
a simple way to discriminate between two well characterized phenotypes. Immature 
10 dendritic cells are characterized as APC with a high capacity for antigen uptake and 
processing, which correlates with the high expression of Fey receptor and mannose 
receptor. The mature phenotype is typically characterized by a lower expression of 
these markers, but a high expression of cell surface molecules responsible for T cell 
activation such as class I and class H MHC, adhesion molecules (e.g., CD54 and 
15 CD1 1) and costimulatory molecules (e.g., CD40, CD80, CD86 and 4-1BB). 

APCs may generally be transfected with a polynucleotide encoding SIM2, 
such that SIM2, or an immunogenic portion thereof, is expressed on the cell surface. 
Such transfection may take place ex vivo, and a composition comprising such 
transfected cells may then be used for therapeutic purposes, as described herein. 
20 Alternatively, a gene delivery vehicle that targets a dendritic or other antigen 
presenting cell may be administered to the subject, resulting in transfection that 
occurs in vivo. In vivo and ex vivo transfection of dendritic cells, for example, may 
generally be performed using any methods known in the art, such as those described 
in WO 97/24447, or the gene gun approach described by Mahvi et al., Immunology 
25 and cell Biology 75:456-460, 1997. Antigen loading of dendritic cells may be 
achieved by incubating dendritic cells or progenitor cells with the SIM2 polypeptide, 
DNA (naked or within a plasmid vector) or RNA; or with antigen-expressing 
recombinant bacterium or viruses (e.g., vaccinia, fowlpox, adenovirus or lentivirus 
vectors). Prior to loading, the polypeptide may be covalently conjugated to an 
30 immunological partner that provides T cell help (e.g., a carrier molecule) such as 
described above. Alternatively, a dendritic cell may be pulsed with a non-conjugated 
immunological partner, separately or in the presence of the polypeptide. 
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Agents for downregulating expression or activity of SIM2 (i.e., active 
ingredients) of the present invention can be provided to the subject per se, or as part of 
a pharmaceutical composition where they are mixed with a pharmaceuticaUy 
acceptable carrier. 

5 As used herein a "pharmaceutical composition" refers to a preparation of one 

or more of the active ingredients described herein with other chemical components 
such as physiologically suitable carriers and excipients. The purpose of a 
pharmaceutical composition is to facilitate administration of a compound to an 
organism. 

10 Herein the term "active ingredient" refers to the preparation accountable for the 

biological effect. 

Hereinafter, the phrases "physiologically acceptable carrier" and 
"pharmaceuticaUy acceptable carrier" which may be interchangeably used refer to a 
carrier or a diluent that does not cause significant irritation to an organism and does 

15 not abrogate the biological activity and properties of the administered compound. An 
adjuvant is included under these phrases. One of the ingredients included in the 
pharmaceuticaUy acceptable carrier can be for example polyethylene glycol (PEG), a 
biocompatible polymer with a wide range of solubility in both organic and aqueous 
media (Mutter et al. (1979). 

20 Herein the term "excipient" refers to an inert substance added to a 

pharmaceutical composition to further facilitate administration of an active ingredient. 
Examples, without limitation, of excipients include calcium carbonate, calcium 
phosphate, various sugars and types of starch, cellulose derivatives, gelatin, vegetable 
oils and polyethylene glycols. 

25 Techniques for formulation and administration of drugs may be found in 

"Remington's Pharmaceutical Sciences," Mack Publishing Co., Easton, PA, latest 
edition, which is incorporated herein by reference. 

Suitable routes of administration may, for example, include oral, rectal, 
transmucosal, especially transnasal, intestinal or parenteral delivery, including 

30 intramuscular, subcutaneous and intramedullary injections as well as intrathecal, direct 
intraventricular, intravenous, inrtaperitoneal, intranasal, or intraocular injections. 
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Alternately, one may administer a preparation in a local rather than systemic 
manner, for example, via injection of the preparation directly into a specific region of 
a patient's body. 

Pharmaceutical compositions of the present invention may be manufactured by 
processes well known in the art, e.g., by means of conventional mixing, dissolving, 
granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping or 

lyophilizing processes. 

Pharmaceutical compositions for use in accordance with the present invention 
may be formulated in conventional manner using one or more physiologically 
acceptable carriers comprising excipients and auxiliaries, which facilitate processing 
of the active ingredients into preparations which, can be used pharmaceutically. 
Proper formulation is dependent upon the route of administration chosen. 

For injection, the active ingredients of the invention may be formulated in 
aqueous solutions, preferably in physiologically compatible buffers such as Hank's 
solution, Ringer's solution, or physiological salt buffer. For transmucosal 
adniinistration, penetrants appropriate to the barrier to be permeated are used in the 
formulation. Such penetrants are generally known in the art. 

For oral administration, the compounds can be formulated readily by 
combining the active compounds with pharmaceutically acceptable carriers well 
known in the art. Such carriers enable the compounds of the invention to be 
formulated as tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, 
suspensions, and the like, for oral ingestion by a patient. Pharmacological 
preparations for oral use can be made using a solid excipient, optionally grinding the 
resulting mixture, and processing the mixture of granules, after adding suitable 
auxiliaries if desired, to obtain tablets or dragee cores. Suitable excipients are, in 
particular, fillers such as sugars, including lactose, sucrose, mannitol, or sorbitol; 
cellulose preparations such as, for example, maize starch, wheat starch, rice starch, 
potato starch, gelatin, gum tragacanth, methyl cellulose, hydroxypropylmethyl- 
cellulose, sodium carbomethylcellulose; and/or physiologically acceptable polymers 
such as polyvinylpyrrohdone (PVP). If desired, disintegrating agents may be added, 
such as cross-linked polyvinyl pyrrolidone, agar, or alginic acid or a salt thereof such 
as sodium alginate. 
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Dragee cores are provided with suitable coatings. For this purpose, 
concentrated sugar solutions may be used which may optionally contain gum arabic, 
talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, titanium dioxide, 
lacquer solutions and suitable organic solvents or solvent mixtures. Dyestuffs or 

5 pigments may be added to the tablets or dragee coatings for identification or to 
characterize different combinations of active compound doses. 

Pharmaceutical compositions, which can be used orally, include push-fit 
capsules made of gelatin as well as soft, sealed capsules made of gelatin and a 
plasticizer, such as glycerol or sorbitol. The push-fit capsules may contain the active 

10 ingredients in admixture with filler such as lactose, binders such as starches, lubricants 
such as talc or magnesium stearate and, optionally, stabilizers. In soft capsules, the 
active ingredients may be dissolved or suspended in suitable liquids, such as fatty oils, 
liquid paraffin, or liquid polyethylene glycols. In addition, stabilizers may be added. 
All formulations for oral administration should be in dosages suitable for the chosen 

15 route of administration. 

For buccal administration, the compositions may take the form of tablets or 
lozenges formulated in conventional maimer. 

For administration by nasal inhalation, the active ingredients for use according 
to the present invention are conveniently delivered in the form of an aerosol spray 

20 presentation from a pressurized pack or a nebulizer with the use of a suitable 
propellant, e.g., dichlorodifluoromethane, trichlorofluoromethane, dichloro- 
tetrafluoroethane or carbon dioxide. In the case of a pressurized aerosol, the dosage 
unit may be determined by providing a valve to deliver a metered amount. Capsules 
and cartridges of, e.g., gelatin for use in a dispenser may be formulated containing a 

25 powder mix of the compound and a suitable powder base such as lactose or starch. 

The preparations described herein may be formulated for parenteral 
administration, e.g., by bolus injection or continuous infusion. Formulations for 
injection may be presented in unit dosage form, e.g., in ampoules or in multidose 
containers with optionally, an added preservative. The compositions may be 

30 suspensions, solutions or emulsions in oily or aqueous vehicles, and may contain 
fonnulatory agents such as suspending, stabilizing and/or dispersing agents. 

Pharmaceutical compositions for parenteral administration include aqueous 
solutions of the active preparation in water-soluble form. Additionally, suspensions of 
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the active ingredients may be prepared as appropriate oily or water based injection 
suspensions. Suitable lipophilic solvents or vehicles include fatty oils such as sesame 
oil, or synthetic fatty acids esters such as ethyl oleate, triglycerides or liposomes. 
Aqueous injection suspensions may contain substances, which increase the viscosity of 

5 the suspension, such as sodium carboxymethyl cellulose, sorbitol or dextran. 
Optionally, the suspension may also contain suitable stabilizers or agents which 
increase the solubility of the active ingredients to allow for the preparation of highly 
concentrated solutions. 

Alternatively, the active ingredient may be in powder form for constitution 

10 with a suitable vehicle, e.g., sterile, pyrogen-free water based solution, before use. 

The preparation of the present invention may also be formulated in rectal 
compositions such as suppositories or retention enemas, using, e.g., conventional 
suppository bases such as cocoa butter or other glycerides. 

Pharmaceutical compositions suitable for use in context of the present 

15 invention include compositions wherein the active ingredients are contained in an 
amount effective to achieve the intended purpose. More specifically, a therapeutically 
effective amount means an amount of active ingredients effective to prevent, alleviate 
or ameliorate symptoms of disease or prolong the survival of the subject being treated. 
Determination of a therapeutically effective amount is well within the 

20 capability of those skilled in the art. 

For any preparation used in the methods of the invention, the therapeutically 
effective amount or dose can be estimated initially from in vitro assays. For example, 
a dose can be formulated in animal models and such information can be used to more 
accurately determine useful doses in humans. 

25 Toxicity and therapeutic efficacy of the active ingredients described herein can 

be determined by standard pharmaceutical procedures in vitro, in cell cultures or 
experimental animals. The data obtained from these in vitro and cell culture assays 
and animal studies can be used in formulating a range of dosage for use in human. 
The dosage may vary depending upon the dosage form employed and the route of 

30 administration utilized. The exact formulation, route of administration and dosage can 
be chosen by the individual physician in view of the patient's condition. (See e.g., 
Fingl, et al., 1975, in "The Pharmacological Basis of Therapeutics", Ch. 1 p.l). 
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Depending on the severity and responsiveness of the condition to be treated, 
dosing can be of a single or a plurality of administrations, with course of treatment 
lasting from several days to several weeks or until cure is effected or diminution of the 
disease state is achieved. 
5 The amount of a composition to be administered will, of course, be dependent 

on the subject being treated, the severity of the affliction, the manner of 
administration, the judgment of the prescribing physician, etc. 

Compositions including the preparation of the present invention formulated in 
a compatible pharmaceutical carrier may also be prepared, placed in an appropriate 
10 container, and labeled for treatment of an indicated condition. 

Pharmaceutical compositions of the present invention may, if desired, be 
presented in a pack or dispenser device, such as an FDA approved kit, which may 
contain one or more unit dosage forms containing the active ingredient. The pack 
may, for example, comprise metal or plastic foil, such as a blister pack. The pack or 
15 dispenser device may be accompanied by instructions for administration. The pack or 
dispenser may also be accommodated by a notice associated with the container in a 
form prescribed by a governmental agency regulating the manufacture, use or sale of 
pharmaceuticals, which notice is reflective of approval by the agency of the form of 
the compositions or human or veterinary administration. Such notice, for example, 
20 may be of labeling approved by the U.S. Food and Drug Administration for 
prescription drugs or of an approved product insert. 

It will be appreciated that the oligonucleotides of the present invention can also 
be expressed from a nucleic acid construct, which can be administered to the subject 
employing any suitable mode of administration, described hereinabove (e.g., in-vivo 
25 gene therapy). Such a nucleic acid construct is introduced into a target cell or cells via 
appropriate gene delivery vehicle/methods (transfection, transduction, homologous 
recombination, etc.) and an expression system as needed and then the modified cells 
are expanded in culture and returned to the subject (i.e., ex-vivo gene therapy). 

Such expression constructs may include a tissue-specific promoter for directing 
30 expression of the downregulating agents in the malignant tissue. Thus an ovarian 
specific promoter such as OSP-1 [Kumaran Cancer Res. (2001) Feb 15;61(4):1291- 
5]and IAI.3B [Hamada Cancer Res. (2003) May 15;63(10):2506-12] may be used. A 
breast-specific promoter which may be used in accordance with the present invention 
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includes rNRL, Muc-1 and mWAP [Hiripi (2003) DNA Cell Biol. 22:41-5; Berger 
(2001) Breast Cancer Res. 3:28-35]. Alternatively a lung specific promoter such as 
CC-10 [Harrod Am J Respir Cell Mol Biol. (2002) Feb;26(2):216-23]and SP-C [Duan 
Oncogene. (2002) Nov 7;21(51):7831-8]may be used 
5 Expression of duplex oligonucleotides is preferably effected via expression 

vectors specifically designed for such use. For example, the pSUPER™ including the 
polymerase-in Hl-RNA gene promoter with a well defined start of transcription and a 
termination signal consisting of five thymidines in a row (T5) [Brummelkamp (2002) 
Science 296:550-53]. Another suitable siRNA expression vector encodes the sense 

10 and antisense siRNA under the regulation of separate poim promoters [Miyagishi and 
Taira[(2002) Nature Biotech. 20:497-500]. The resultant siRNA includes 5 thymidine 
termination signal. Alternatively, oligonucleotide sequences can be placed under bi- 
directional promoters to produce both the sense and antisense transcripts from the 
same promoter construct, thus simplifying the construction of expression vectors and 

15 achieving an equal molar ratio of cellular sense and antisense sequences. Examples 
for bi-directional promoters are disclosed in U.S. Pat. Appl. No. 20020108142. 
It will be appreciated that when duplex oligonucleotide are used, transfection reagents 
dedicated to siRNA transfer to mammalian cells are preferably employed. Examples 
for such include but are not limited to siPORT™ Amine (i.e., a polyamine mixture) 

20 and siPORT™ Lipid (i.e., a mixture of cationic and neutral lipids). 

Accordingly, in cases where the duplex oligonucleotides of the present 
invention are introduced into a cell in which RNA interference (RNAi) does not 
normally occur, the factors needed to mediate RNAi are introduced into such a cell or 
the expression of the needed factors is induced, as disclosed in U.S. Pat. Appl. No.: 

25 20020086356. 

It will be appreciated that treatment of subjects exhibiting mutated SIM2 
transcripts may also be effected using a "knock in" strategy (see U.S. Pat. No. 
6,265,632), wherein endogenous SIM2 sequence alterations are corrected using 
advanced gene therapy. 

30 As is mentioned hereinabove, the present inventors uncovered novel isofonns 

ofSIM-2. 

Thus, according to another aspect of the present invention there is provided an 
isolated polynucleotide comprising a nucleic acid sequence encoding a polypeptide 



WO 2004/012847 




PCI7IL2003/000636 



45 

being at least 60 %, at least 65 %, at least 70 %, at least 75 %, at least 80 %, at least 82 
%, at least 86 %, at least 88 %, at least 90 %, at least 92 %, at least 94 % or more, say 
95 % - 100 % homologous to SEQ ID NO: 39, as determined using the BestFit 
software of the Wisconsin sequence analysis package, utilizing the Smith and 
5 Waterman algorithm, where the gap creation equals 8 and gap extension penalty 
equals 2. 

According to one preferred embodiment of this aspect of the present invention 
the isolated polynucleotide is at least 60 %, at least 65 %, at least 70 %, at least 75 %, 
at least 80 %, at least 82 %, at least 86 %, at least 88 %, at least 90 %, at least 92 % at 

10 least 94 % or more, say 95 % - 100 % identical to SEQ ID NO: 39, as determined 
using BestFit software of the Wisconsin sequence analysis package, utilizing the 
Smith and Waterman algorithm, where gap weight equals 50, length weight equals 3, 
average match equals 10 and average mismatch equals -9. 

According to another preferred embodiment of this aspect of the present 

15 invention the isolated polynucleotide is as set forth in SEQ ID NO: 2. 

According to yet another aspect of the present invention there is provided an 
isolated polynucleotide comprising a nucleic acid sequence encoding a polypeptide 
being at least 50 %, at least 5 %, at least 60 %, at least 65 %, at least 70 %, at least 75 
%, at least 80 %, at least 82 %, at least 86 %, at least 88 %, at least 90 %, at least 92 

20 %, at least 94 % or more, say 95 % - 100 % homologous to SEQ ID NOs: 40 or 41, as 
determined using the BestFit software of the Wisconsin sequence analysis package, 
utilizing the Smith and Waterman algorithm, where the gap creation equals 8 and gap 
extension penalty equals 2. 

According to one preferred embodiment of this aspect of the present invention 

25 the isolated polynucleotide is at least 50 %, at least 5 %, at least 60 %, at least 65 %, at 
least 70 %, at least 75 %, at least 80 %, at least 82 %, at least 86 %, at least 88 %, at 
least 90 %, at least 92 %, at least 94 % or more, say 95 % - 100 % identical to SEQ ID 
NOs: 40 or 41, as determined using BestFit software of the Wisconsin sequence 
analysis package, utilizing the Smith and Waterman algorithm, where gap weight 

30 equals 50, length weight equals 3, average match equals 10 and average mismatch 
equals —9. 

According to another preferred embodiment of this aspect of the present 
invention the isolated polynucleotide is as set forth in SEQ ID NO: 3. 
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As used herein the phrase "an isolated polynucleotide" refers to a single or 
double stranded nucleic acid sequences which is isolated and provided in the form of 
an RNA sequence, a complementary polynucleotide sequence (cDNA), a genomic 
polynucleotide sequence and/or a composite polynucleotide sequences (e.g., a 

5 combination of the above). 

As used herein the phrase "complementary polynucleotide sequence" refers to 
a sequence, which results from reverse transcription of messenger RNA using a 
reverse transcriptase or any other RNA dependent DNA polymerase. Such a sequence 
can be subsequently amplified in vivo or in vitro using a DNA dependent DNA 

10 polymerase. 

As used herein the phrase "genomic polynucleotide sequence" refers to a 
sequence derived (isolated) from a chromosome and thus it represents a contiguous 
portion of a chromosome. 

As used herein the phrase "composite polynucleotide sequence" refers to a 
15 sequence, which is at least partially complementary and at least partially genomic. A 
composite sequence can include some exonal sequences required to encode the 
polypeptide of the present invention, as well as some intronic sequences interposing 
therebetween. The intronic sequences can be of any source, including of other genes, 
and typically will include conserved splicing signal sequences. Such intronic 
20 sequences may further include cis acting expression regulatory elements. 

Since the polynucleotide sequences of the present invention encode previously 
unidentified polypeptides, the present invention also encompasses isolated 
polypeptides or portions thereof which are encoded by the isolated polynucleotide 
which are described hereinabove. 
25 Thus, this aspect of the present invention also encompasses polypeptides which 

are set forth in SEQ ID NO: 39, 40 or 41, homologues thereof (selected from the 
homology range of 60-100 % described hereinabove) fragments thereof and altered 
polypeptides characterized by mutations, such as deletion, insertion or substitution of 
one or more amino acids, either naturally occurring or man induced, either randomly 

30 or in a targeted fashion. 

Since expression of SIM2 is correlatable with cancer development (see WO 
02/12565) the present invention also envisages the use of the novel sequences in 
diagnosis and treatment of cancer. Examples include, but are not limited to bone 
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cancers, brain tumors, breast cancer, endocrine system cancers, gastrointestinal 
cancers, gynecological cancers, head and neck cancers, leukemia, lymphomas, 
metastases, myelomas, pediatric cancers, penile cancer, prostate cancer, sarcomas, skin 
cancers, thyroid cancer, thyoma, urinary tract cancers, carcinoma of unknown primary 

5 and Li-Fraumeni syndrome. 

As is illustrated in the Examples section which follows, the present inventors 
have shown through laborious experimentation that these sequences are differentially 
expressed in colon adenocarcinoma, in lung adenocarcinoma and in lung squamous 
cell carcinoma supporting the use of such sequences in diagnosis and treatment of 

10 cancer as described above. 

Additional objects, advantages, and novel features of the present invention will 
become apparent to one ordinarily skilled in the art upon examination of the following 
examples, which are not intended to be limiting. Additionally, each of the various 
15 embodiments and aspects of the present invention as delineated hereinabove and as 
claimed in the claims section below finds experimental support in the following 
examples. 

EXAMPLES 

20 Reference is now made to the following examples, which together with the 

above descriptions, illustrate the invention in a non limiting fashion. 

Generally, the nomenclature used herein and the laboratory procedures utilized 
in the present invention include molecular, biochemical, microbiological and 
recombinant DNA techniques. Such techniques are thoroughly explained in the 

25 literature. See, for example, "Molecular Cloning: A laboratory Manual" Sambrook et 
al., (1989); "Current Protocols in Molecular Biology" Volumes ME Ausubel, R. M., 
ed. (1994); Ausubel et al., "Current Protocols in Molecular Biology", John Wiley and 
Sons, Baltimore, Maryland (1989); Perbal, "A Practical Guide to Molecular Cloning", 
John Wiley & Sons, New York (1988); Watson et al., "Recombinant DNA", Scientific 

30 American Books, New York; Birren et al. (eds) "Genome Analysis: A Laboratory 
Manual Series", Vols. 1-4, Cold Spring Harbor Laboratory Press, New York (1998); 
methodologies as set forth in U.S. Pat. Nos. 4,666,828; 4,683,202; 4,801,531; 
5,192,659 and 5,272,057; "Cell Biology: A Laboratory Handbook", Volumes I-m 
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Cellis, J. E., ed. (1994); "Current Protocols in Immunology" Volumes I-HI Coligan J. 
E., ed. (1994); Stites et al. (eds), "Basic and Clinical Immunology" (8th Edition), 
Appleton & Lange, Norwalk, CT (1994); Mishell and Shiigi (eds), "Selected Methods 
in Cellular Immunology", W. EL Freeman and Co., New York (1980); available 

5 immunoassays are extensively described in the patent and scientific literature, see, for 
example, U.S. Pat. Nos. 3,791,932; 3,839,153; 3,850,752; 3,850,578; 3,853,987; 
3,867,517; 3,879,262; 3,901,654; 3,935,074; 3,984,533; 3,996,345; 4,034,074; 
4,098,876; 4,879,219; 5,011,771 and 5,281,521; "Oligonucleotide Synthesis" Gait, M. 
J., ed. (1984); "Nucleic Acid Hybridization" Hames, B. D., and Higgins S. J., eds. 

10 (1985); "Transcription and Translation" Hames, B. D., and Higgins S. J., Eds. (1984); 
"Animal Cell Culture" Freshney, R. L, ed. (1986); "Immobilized Cells and Enzymes" 
IRL Press, (1986); "A Practical Guide to Molecular Cloning" Perbal, B., (1984) and 
"Methods in Enzymology" Vol. 1-317, Academic Press; "PCR Protocols: A Guide To 
Methods And Applications", Academic Press, San Diego, CA (1990); Marshak et al., 

15 "Strategies for Protein Purification and Characterization - A Laboratory Course 
Manual" CSHL Press (1996); all of which are incorporated by reference as if fully set 
forth herein. Other general references are provided throughout this document. The 
procedures therein are believed to be well known in the art and are provided for the 
convenience of the reader. All the information contained therein is incorporated 

20 herein by reference. 

EXAMPLE 1 
Genomic organization of SJM2 
Schematic presentation of SIM2 lung specific transcripts is shown in Figure 1. 
25 Specifically, the genomic alignment of the exons of SEQ ID NOs: 1-3, 7 and 8 is 
shown. The genomic sequence vised as a reference is chromosome 21q22.13, starting 
at position 36992386 and terminating at position 37042613. Genomic sequences are 
available at UCSC Genome Bioinformatics database, release version April 2002 
(http://genome.ucsc.edu) . 
30 Table 1, below shows the coordinates of the exons on the genomic sequence 

for each SEQ ID NO, correspondingly. 
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Table 1 



SEQWNO: 


Exon coordinates on 
expressed sequence 


Relative exon coordinates on 
genomic sequence 


1 


1-1001 


12073-13073 


2 


1-312 


12073-12384 


2 


313^*04 


13843-13932 


2 


405-513 


21132-21240 


2 


514-599 


24356-24441 


2 


600-621 


27430-27451 


3 


1-517 


1001-1517 


3 


518-1766 


1930-3178 


3 


1767-3143 


5613-6989 


3 


3144-3473 


7432-7761 


3 


3474-3640 


8343-8509 


3 


3541-3895 


9165-9419 


3 


3896-4159 


9589-9852 


3 


4160-4644 


10281-10765 


3 


4645-8296 


11543-15194 


7 


1-231 


1001-1231 


7 


232-314 


10478-10560 


7 


315-404 


13843-13932 


7 


405-513 


21132-21240 


7 


514-599 


24356-24441 


7 


600-799 


27430-27629 


7 


800-906 


32356-32462 


7 


907-1054 


43028-43175 


7 


1055-1223 


44698-44866 


7 


1224-1632 


46039-46447 


7 


1633-3885 


48976-51228 


8 


1-231 


1001-1231 


8 


232-314 


10478-10560 


8 


315^04 


13843-13932 


8 


405-513 


21132-21240 


8 


514-599 


24356-24441 


8 


600-799 


27430-27629 


8 


800-906 


32356-32462 


8 


907-1054 


43028-43175 


8 


1055-1223 


44698-44866 


8 


1224-2823 


46039-46447 



EXAMPLE 2 
Materials and Experimental Procedures 
5 * RNA preparation — RNA was commercially obtained from Clontech 

(Franklin Lakes, NJ USA 07417, www.clontech.coin) or BioChain Inst. Inc. 
(www.biochain.com) or ABS or Clinomics. Alternatively RNA was purified from 
tissue samples using TRI-Reagent (Molecular Research Center), according to 
Manufacturer's instructions. Tissue samples were obtained from subjects or from 
10 postmortem. Total RNA samples were treated with DNasel (Ambion) then purified 
using RNeasy columns (Qiagen). 



WO 2004/012847 




PCT/IL2003/000636 



50 

Lung - Lu_lN-Lu_5N sample was prepared from commercially available 
normal lung RNAs (BioChain, Cat. No. CDP-061010, Lot Nos: A503205, A503384, 
A503385, A503204, A503206). RT Lu_6N was prepared from a pool of 6 normal lung 
RNAs (BioChain, Cat. No. CDP-061010, Lot No. A409363). 
5 LuJ7C~ Lu_16C samples were prepared from lung adenocarcinoma RNAs: 

samples LuJ7C- LuJLOC were prepared from commercially available RNAs 
(BioChain, Cat. No. CDP-064004A, Lot Nos: A504117, A504119, A504116, 
A504118), and samples Lu_llC- Lu_16C were prepared from RNA purified from 
tissue samples of subjects. 

10 RTs Lu_17C- Lu_31C were prepared from squamous cell carcinoma RNAs: 

samples Lu_17C- Lu_22C were prepared from commercial RNAs (BioChain, Cat. No. 
CDP-064004B, Lot Nos: A503187, A503386, A503387, A503183, A411075), sample 
Lu_27C was prepared from commercial RNA (Clontech, Cat No: 64013-1), samples 
Lu_28C - Lu_30C were prepared from commercial RNAs (BioChain, Cat. No. CDP- 

15 064004, Lot Nos: A409017, A409091, A408175), and samples Lu_23C- Lu_26C and 
Lu_3 1C were prepared from RNA purified from subjects tissue samples. 

Lu_32C- Lu_35C were prepared from commercial small cell carcinoma RNAs 
(BioChain, Cat. No. CDP-064004D, Lot Nos: A504115, A501390, A501389, 
A501391). 

20 Lu_36C- Lu_37C were prepared from commercial large cell carcinoma RNAs 

(BioChain, Cat. No. CDP-064004C, Lot Nos: A504113, A504114). 

Lu_38C was prepared from commercial alveolus cell carcinoma RNAs 
(BioChain, Cat. No. CDP-064004, Lot Nos: A409089). 

RT Lu_39C was prepared from lung carcinoma RNA purified from subject 
25 tissue sample (with no forther subcaracterization). 

Lu_40 H1299 was prepared from RNA purified from NCI_H1299 cell line 
non-small cell carcinoma (ATCC Catalog No: CRL-5803). 

SG_41 was prepared from commercial normal salivary gland RNA (pool of 24) 
(Clontech, Cat No: 64110-1). 
30 Lu_16N and Lu_25N were prepared from RNA purified from subjects normal 

tissue samples matched to the cancer samples Lu_16C and Lu_25C. 

Colon - RTs marked as "Col_xN" (Col_2N- Col_276N) were prepared from 
normal colon RNAs. Samples Col_22N and Col_23N were prepared from commercial 
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RNAs (BioChain, Cat. No. CDP-064007, Lot Nos: A501132, A501130), and the rest 
were prepared from RNA purified from normal subjects tissue samples. Normal pool 
sample was prepared form comercial RNA pool of normal colon (BioChain, Cat No. 
CDP-061003, Lot Nos: A411078). 
5 RTs Col 2C- Col_23C were prepared from colon adenocarcinoma RNAs, 

matched to the normal samples Col_2N- Col_23N. Samples Col_22C and Col_23C 
were prepared from commercial RNAs BioOiain, Cat. No. CDP-064007, Lot Nos: 
A501131, A501129), and the rest were prepared from RNA purified from subjects 
colon adenocarcinoma cancer tissue samples. 
10 Colon cell lines - Col-SW620 is epithelial colorectal adenocarcinoma from 

metastatic lymph node, Dukes C (ATCC Catalog No: CCL-227), Col-SW480 is 
epithelial colorectal adenocarcinoma, Dukes B (ATCC Catalog No: CCL-228) and 
Col-DLDl is epithelial colorectal adenocarcinoma, Dukes C (ATCC Catalog No: 
CCL-221). 

15 Ovary - Ovarian RNA was generated as described in Table 2, below. 



Table 2 



Serial number i 


Lot numbet t 


Source j 


Tissue I 


Pathology (grade) 


1-Pap Adeno G3 1 


LS-1406 i 


\BS ( 


>vary 


Dapillary adenocarcinoma (3) 


2-Pap Adeno G2 } 


LS-1408 i 


\BS ( 


jvary 


Dapillary adenocarcinoma (2) 


3-Pap Adeno G2 1 


□LS-1431 i 


\BS c 


jvary 


papillary adenocarcinoma (2) 


4-Pap CystAde G2 } 


CLS-7286 , 


r\BS 


yvary 


papillary cystadenocarcinoma (2) 


5-Adeno G3 


29-12-G432 


GOG 


Dvary 


adenocarcinoma (3) 


6-Adeno G3 


A.0106 


ABS 


ovary 


adenocarcinoma (3) 


7-Adeno G3 


LND-00375 


ABS 


ovary 


adenocarcinoma (3) 


8-Adeno G3 


A501U3 


BioChain 


ovary 


adenocarcinoma (3) 


9- Adeno G3 


99-06-G901 


GOG 


ovary 


adenocarcinoma (maybe serous) 
(3) 


10-Adeno G3 


A407069 


Biochain 


ovary 


adenocarcinoma (3) 


ll-Adeno G3 


A407068 


Biochain 


ovary 


adenocarcinoma (3) 


12-Adeno G3 


A406023 


Biochain 


ovary 


adenocarcinoma (3) 


13-Adeno G3 


94-05-7603 


GOG 


right ovary 


metastasis adenocarcinoma (3) 


14-Adeno G2 


A50U11 


BioChain 


ovary 


adenocarcinoma (2) 


15-Carcinoma G3 


A407065 


BioChain 


ovary 


carcinoma (3) 


16-Carcinoma 


109038 


Clontech 


ovary 


carcinoma NOS 


17-Muc Adeno G3 


A504084 


BioChain 


ovary 


mucinous adenocarcinoma (3) 


18-Muc Adeno G3 


A504083 


BioChain 


ovary 


mucinous adenocarcinoma (3) 


19-Muc Adeno G3 


A504085 


BioChain 


ovary 


mucinous adenocarcinoma 


20-Pap Muc CvstAde 


USA-0027: 


JABS 


ovary 


papillary mucinous 
cystadenocarcinoma 


21-Muc CystAde G2-3 


95-10-G02! 


OGOG 


ovary 


mucinous cystadenocarcinoma (2 



WO 2004/012847 




PCT/IL2003/000636 



52 



22-Muc CysiAde G2 


A0139 


ABS 


ovary 


mucinous cystadenocarcinoma (2) 


23-Muc CystAde G3 


VNM- 
00187 


ABS 


ovary 


mucinous cystadenocarcinoma 
with low malignant (3) 


24-Pap Sero Adeno G3 


2001-07- 
G801 


GOG 


ovary 


papillary serous adenocarcinoma 
(3) 


25-Pap Sero Adeno G3C 


N0021 


ABS 


ovary 


papillary serous adenocarcinoma 
(3C) 


26Sero Adeno G3 


2001-12- 
G035 


GOG 


right ovary 


serous adenocarcinoma (3) 


27-Pap Sero Card G3 


2001-08- 
G011 


GOG 


ovary 


papillary serous carcinoma (3) 


2o- jrap i>ero CystAde G3 


A503176 


rSioUnain 


ovary 


serous papillary 
cystadenocarcinoma (3) 


29- Pap Sero CystAde G3 


93-09-4901 


GOG 


ovary 


serous papillary 
cystadenocarcinoma (3) 


30- Pap Sero CystAde G3 


A503175 


BioChain 


ovary 


serous papillary 
cystadenocarcinoma (1) 


31-Pap Endo Adeno G3C 


95-04-2002 


GOG 


ovary 


papillary endometrioid 
adenocarcinoma (3C) 


32-Endo Adeno G2 


94-08-7604 


GOG ! 


right ovary 


endometrioid adenocarcinoma (2) 


33-Endo Adeno Gl-2 


2000-09- 
G621 


GOG 


ovary 


endometrial adenocarcinoma (1-2) 


34-Mix Sero/Endo G3 


2002-05- 
G513 


GOG 


ovary 


mixed serous and endometrioid 
adenocarcinoma (3) 


35-Mix Sero/Endo G3 


2002-05- 
G509 


GOG 


ovary 


mixed serous and endometrioid 
adenocarcinoma of mullerian (3) 


36-Mix Sero/Endo G3 


2001-12- 
G037 


GOG 


ovary 


mixed serous and endometrioid 
adenocarcinoma (3) 


37-Mbc Sero/Endo G2 


95-11-G006 


GOG 


ovary,endomett 
ium 


papillary serous and endometrioid 
cystadenocarcinoma (2) 


9 a hat* - c rn at _ rwTi j 

38-Mex Sero/Muc/Endo G2 


98-03-G803 


GOG 


ovary 


mixed epithelial 
cystadenocarcinoma with 
mucinous, endometrioid, 
squamous and papillary serous(2) 


39-Adeno borderline 


98-08-G001 


GOG 


ovary 


epithelial adenocarcinoma of 
borderline malignancy 


40-Clear cell Adeno G3 


2001-10- 
G002 


GOG 


ovary 


clear cell adenocarcinoma (3) 


41-Clear cell Adeno 


2001-07- 
G084 


GOG 


ovary 


clear cell adenocarcinoma 


42-N 


A503274 


BioChain 


ovary 


Normal 


43-N 


A504086 


BioChain 


ovary 


Normal 


44-N 


061P43A 


Ambion 


ovary 


Normal 


45-N 


A504087 


BioChain 


ovary 


Normal 


46-NM14 


A501112 


BioChain 


ovary 


Normal (matched tumor A501 1 1 V 


47-NM8 


A501114 


BioChain 


ovary 


Normal (matched tumor A50 1 1 1 3] 


48-NM38 


98-03- 
G803N 


GOG 


ovary 


Normal (matched tumor 98-03- 
G803) 


49-NM39 


98-08- 
G001N 


GOG 


ovary 


Normal (matched tumor 98-08- 
G001) 



RT reaction with oligo-dT - Reverse transcription was effected using 2 jag of 
total RNA, in a 20 pi reaction, including 200 units of Superscript II Reverse 
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Transcriptase (Bibco/BRL) in the buffer supplied by the manufacturer, 500 pmol of 
oligo(dT) 2 5 (Promega Corp. Madison WI, USA), and 40 units of RNasin (Promega 
Corp. Madison WI, USA). 

Real Time PCR - 5ul RT reaction products, diluted in final reaction volume of 
5 20ul was used for amplification. Specific oligonucleotides (SEQ ID NO:4 and SEQ 
ID NO:5) were used as primers. The ABI Prism 7000 Sequence Detection System was 
used for cycling. The reaction was effected using SYBR GreenPCR Master Mix 
(Applied Biosystems). The cycle in which the reactions achieved a threshold level 
(Ct) of fluorescence was registered and served to calculate the initial transcript 

10 quantity in the RT reaction. Control PCR reactions were effected on the same RT 
sample, using PCR primers specific to the house keeping gene ribosomal protein S27a 
(RPS27A). An amplicon fragment thereof (SEQ ID NO: 23) was generated using the 
primers set forth in SEQ ID NOs: 21 and 22. For each primer set, the value of each 
PCR reaction was divided into the value of one of the RTs (a pool of normal lung 

15 transcripts from BioChain). In order to normalize the results, the ratio for each PCR 
reaction was then divided by the house keeping gene ratio for the same RT. 

RT PCR for Examples 6-9 - 1 jxg of treated RNA was mixed with 150 ng 
Random Hexamer primers (Invitrogen) and 500 jxM dNTP in total volume of 15.6 fxl. 
The mixture was incubated for 5min at 65°C and then quickly chilled on ice. Then, 5jil 

20 5 X Superscriptn first strand buffer (Invitrogen), 2.4 p,l 0.1 M DTT and 40 units 
Rnasin (Promega) were added, and the mixture was incubated for 10 ruin at 25 °C, 
followed by further incubation at 42 °C for 2 min. Then, 1 jal (200 units) of 
Superscriptn (Invitrogen) was added and the reaction (final volume of 25 jxl) was 
incubated for 50 min at 42 °C and then inactivated at 70 °C for 15 min. The resulting 

25 cDNA was diluted 1 :20 in 1 0 mM TO. IE. 

Real-Time RT-PCR used for analyzing expression pattern of SEQ ID NOs. 
1, 9, 18 and 19 in different ovary and lung samples (Examples 7-9, Figures 8-11)- 
5fxl of diluted cDNA prepared with random primers were used as a template in Real- 
Time PCR reactions using the SYBR Green I assay (PE Applied Biosystem) with 

30 specific primers. The amplification stage was effected as follows, 50 °C for 2 min, 95 
°C for 10 min, 95 °C for 15 sec, followed by 60 °C for 1 min. Detection was effected 
using PE Applied Biosystem SDS 7000. The cycle in which the reactions achieved a 
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threshold level (Ct) of fluorescence was registered and served to calculate the initial 
transcript quantity in the RT reaction. The quantity was calculated using a standard 
curve created using serial dilutions of either a purified amplicon product or reverse 
transcription (RT) reaction prepared from RNA mix purified from 5 cell-lines 
5 (HCT1 1 6, H1299, DU145, MCF7, ES-2). To minimize inherent differences in the RT 
reaction, the resulting quantity was normalized to the geometric mean of the 
quantities of several housekeeping genes (different HSKP genes were used for the 
different tissue panels). 

RT-PCR analysis used for analyzing expression pattern ofSEQ ID NOs. 18 

10 and 19 in different breast samples (examples 10,11, Figures 12,13 ) - 5 \i\ of diluted 
cDNA prepared with random primers were used as a template in RT-PCR reactions 
using the SYBR Green I assay (PE Applied Biosystem) with specific primers. The 
amplification stage was effected as follows, 50 °C for 2min, 95 °C for 10 min, 95 °C 
for 15 sec, followed by 60 °C for 1 min. PCR products were analyzed by 1.8 % 

15 agarose gels. 

EXAMPLE 3 

Expression pattern of a SIM2 derived fragment (SEQ ID NO:20) in normal and 

malignant lung samples 

20 The expression level of a SIM2 derived fragment corresponding to SEQ ID 

NO:20, which is a fragment of SEQ ID NO: 1 (nucleotide coordinates 125-225, see 
Figure 1) was determined using the primers set forth in SEQ ID NOs: 4 and 5 
(designated as primers 1 and 2 in Figure 1) and measured by real time PCR. The 
expression of the housekeeping gene RPS27A (GenBank Accession No. NM_002954, 

25 SEQ ID NO: 23) was determined similarly using primers: SEQ ID NOs: 21, 22. 
Expression values were first normalized to a housekeeping gene. The expression was 
then calculated relative to a pool of normal lung samples (Lu_6N). 

As shown in Figure 2, representing two duplicates of the same experiment, the 
expression of the SIM2 fragment (SEQ ID NO: 1) in the normal samples was 

30 significantly lower than in the tumor samples. Interestingly, high expression was 
found in adenocarcinoma samples (4 out of the 9 samples) and squamous cell 
carcinoma samples (7 out of the 14 samples). The expression in these samples was 
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between 10 and 200 fold higher as compared to the expression pattern in normal 
samples. 

EXAMPLE 4 

Expression pattern of a SIM2 derived fragment (SEQ ZD NO:20) in normal and 

malignant colon samples 
The expression levels of a SIM2 derived fragment corresponding to SEQ ID 
NO. 20 (a portion of SEQ ID NO: 1 amplified by the primers set forth in SEQ ID NOs. 
4-5) as well as the housekeeping gene RPS27A (GenBank Accession No. 
NMJ)02954, SEQ ID NO: 23) using primers: SEQ ID NOs: 21, 22) were measured by 
real time PCR. Each value was first normalized to the housekeeping gene. The 
expression level was then calculated relative to a pool of RNA from normal colon 
(Col-normal pool). 

As shown in Figure 3, representing two duplicates of the same experiment, the 
expression in most of the adenocarcinoma and cell lines samples was higher than in 
the normal samples. 

EXAMPLE 5 

Expression pattern of a SIM2 derived fragment (SEQ ID NO: 9) in normal and 
20 malignant lung samples 

Expressions of long (SEQ ID NO: 7) and short (SEQ ID NO: 8) SIM variants 
was measured by real time PCR using a sequence fragment (SEQ ID NO: 9, and the 
primers set forth in SEQ ID NOs: 10 and 11, designated as primers 9 and 10 in Figure 
1, respetively) which is shared by both sequences. Each expression was first 
25 normalized to RPS27A (SEQ ID NO: 23; using primers: SEQ ID NOs: 21, 22) 
(Figures 4 through 7). Then the expression level was calculated relative to a pool of 
normal lungs (Lu_6N). 

As shown in Figures 4 through 7, SIM2 expression in normal samples was very 
low. High expression was found in adenocarcinoma samples and squamous cell 
30 carcinoma samples. Note that over-expression in tumor samples was 10 to 2000 fold 
relatively to the expression of SIM2 in normal samples. Figures 5 and 7 show a better 
resolution pattern of SIM2 expression on a scale of 0-200. 



10 
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EXAMPLE 6 

Expression of SIM2 derived fragments in normal and cancerous ovary tissues 
The expression of two different SIM2-derived sequences (SEQ ID NOs:20 and 
9) and three housekeeping genes - PBGD (GenBank Accession No: HSPBGDR, SEQ 

5 ID NO: 32; using the primers set forth in SEQ ID NOs: 30, 31), ATP-6-syn (GenBank 
Accession No: NM_1733702, SEQ ID NO: 26; using the primers set forth in SEQ ID 
NOs: 24, 25), 18s ribosomal RNA (GenBank Accession No: HSRRN18S, SEQ ID 
NO: 29; using the primers set forth in SEQ ID NOs: 27, 28) were measured by real 
time PCR. In each RT sample, the expression of S1M2 sequences was normalized to 

10 the geometric mean of the quantities of three housekeeping genes PBGD, ATP-6-syn, 
18s ribosomal RNA, as detailed in Example 2, hereinabove. The normalized quantity 
of each RT sample was then divided by the normalized quantity of a normal sample 
(No. 45, Table 2 above). 

As shown in Figure 8, SIM2 expression in normal samples (samples nos. 42- 

15 49, Table 2 above) was significantly lower than in the cancer samples. Notably, the 
highest expression of SIM2 was found in papilary serous (carcinoma or 
adenocarcinoma or cystadenocarcinoma) samples (samples 24, 27, 29, Table2). 

EXAMPLE 7 

20 Expression of SIM2 long variant - derived fragment (SEQ W NO: 18) in normal and 

cancerous ovary tissues 
Expression of S1M2 long variant (GenBank Accession No: gi7108363, SEQ ID 
NO: 7) was measured by real time PCR using a fragment (SEQ ID NO: 18 
corresponding to nucleotide coordinates 1551*1670 of SEQ ID NO: 7 using the 

25 primers set forth in SEQ ID NOs:14-15, designated as primers 12 and 13 in Figure 1, 
respectively). In addition the expression of two housekeeping genes - PBGD 
(GenBank Accession No: HSPBGDR, SEQ ID NO: 32; vising the primers set forth in 
SEQ ID NOs: 30-31) and HPRT1 (GenBank Accession No: GI_32449, SEQ ID NO: 
35; using the primers set forth SEQ ID NOs: 33-34), was measured by real time PCR. 

30 In each RT sample, the expression of SIM2 sequences was normalized to the 
geometric mean of the quantities of the housekeeping genes as detailed in Example 2, 
hereinabove. The normalized quantity of each RT sample was then divided by the 
averaged quantity of the normal samples (No. 42-48, Table 2). 
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As shown in Figure 9, SIM2 expression in normal samples (sample Nos. 42- 
48, Table 2) was significantly lower than in the cancer samples. Notably, the highest 
expression of SIM2 was found in 4 out of 6 papillary serous samples (carcinoma or 
adenocarcinoma or cystadenocarcinoma). 

EXAMPLE 8 

Expression ofSIM2long variant - derived fragment (SEQ ID NO:18) in normal and 

cancerous lung tissues 
Expression of SIM2 long variant (SEQ ID NO: 7) was measured in normal and 
cancerous lung tissues (see Table 3, below) by real time PCR using a sequence 
fragment (SEQ ID NO: 18 corresponding to nucleotide coordinates 1551-1670 of SEQ 
ID NO: 7 using the primers set forth in SEQ ID NOs:14-15, designated as primers 12 
and 13 in Figure 1, respectively). In addition the expression of three housekeeping 
genes - SDHA (GenBank Accession No: NM_004168, SEQ ID NO: 38 was measured 
using the primers set forth in SEQ ID NOs: 36 and 37), RPS27A (GenBank Accession 
No: NM_002954, SEQ ID NO: 23 was measured using the primers set forth in SEQ 
ID NOs: 21, 22), PBGD (GenBank Accession No: HSPBGDR, SEQ ID NO: 32; using 
the primers set forth in SEQ ID NOs: 30, 31), was measured by real time PCR. In 
each RT sample, the expression of SIM2 sequences was normalized to the geometric 
mean of the quantities of the housekeeping genes as detailed in Example 2, 
hereinabove. The normalized quantity of each RT sample was then divided by the 
averaged quantity of the normal samples (No. 46-54, Table 3). 



Table 3 



Serial number 


Lot number 


Pathology 


Source 


1 


A504117 


Adenocarcinoma 


Biochain 


2 


A504118 


Adenocarcinoma 


Biochain 


3 


CG-200 


Adenocarcinoma 


Ichilov 


4 


Com-02-43T-M-2237T 


Adenocarcinoma Grade 1 


Clinomics 


5 


Com-02-49T-M-2214T 


Adenocarcinoma Grade 2 


Clinomics 


6 


Com-02-55T-M-2206T 


Adenocarcinoma Grade 3 


Clinomics 


7 * 


Com-02-57T-M-2285T 


Adenocarcinoma Grade 4 


Clinomics 


8 


Com-02-59T-M-2261T 


Adenocarcinoma Grade 2 


Clinomics 


9 


Com-02-41T-M-2269T 


Adenocarcinoma Grade 3 


Clinomics 


11 


Com-02-53T-M-2221T 


Adenocarcinoma Grade 1 


Clinomics 


12 


A504119 


Moderately adenocarcinoma 


Biochain 


13 


A504116 


moderately to poorly adenocarcinoma 


Biochain 


14 


CG-111 


Adenocarcinoma 


Ichilov 1 
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15 


CG-244 


Bronchioloalveolar adenocarcinoma 


Ichilov 


16 


A409091 


Moderately squamous 


Biochain 


17 


A503183 


moderately squamous 


Biochain 


18 


A503387 


moderately to poorly squamous 


Biochain 


19 


A408175 


Squamous 


Biochain 


20 


A501121 


Squamous 


Biochain 


21 


A503187 


Squamous 


Biochain 


22 


A503386 


Squamous 


Biochain 


23 


CG-109 (1) 


Squamous 


Ichilov 


24 


CG-123 


Squamous 


Ichilov 


25 


CG-204 


Squamous 


Ichilov 


26 


Com-02-47T-M-2208T 


Squamous Grade 3 


Clinomics 


27 


Com-02-61T-M-2215T 


Squamous Grade 2 


Clinomics 


28 


Com-02-63T-M-22 1 6T 


Squamous Grade 3 


Clinomics 


29 


Com-02-65T-M-2239T 


Squamous Grade 1 


Clinomics 


30 


A501389 


Small cell 


Biochain 


31 


A501390 


Small cell 


Biochain 


32 


A501391 


Small cell 


Biochain 


33 


A504115 


Small cell 


Biochain 


34 


Com-02-45T-M-22 1 7T 


Small cell Grade 2 


Clinomics 


35 


Com-02-69T-M-22 10T 


Small cell Grade 3 


Clinomics 


36 


O>m-02-71T-M-2218T 


Small cell Grade 2 


Clinomics 


37 


Com-02-73T-M-2235T 


Small cell Grade 1 


Clinomics 


38 


A504113 


large cell 


Biochain 


39 


A504114 


large cell 


Biochain 


40 


Com-02-75T-M-2212T 


Large cell Grade 3 


Clinomics 


41 


Com-02-77T-M-2257T 


Large cell Grade 4 


Clinomics 


42 


Com-02-79T-M-224 IT 


Large cell Grade 2 


Clinomics 


43 


Com-02-1 63T-M-2290T 


Large cell Grade 1 


Clinomics 


44 


A501123 


Moderately alveolus carcinoma 


Biochain 


45 


A501221 


Alveolus carcinoma 


Biochain 


46 


A501124 


Normal 


Biochain 


47 


A503205 


Normal 


Biochain 


48 


A503206 


Normal 


Biochain 


49 


A503384 


Normal 


Biochain 


51 


Com-02-44N-M-2237N 


normal M4 


Clinomics 


53 


Com-02-42N-M-2269N 


normal M9 


Clinomics 


54 


Com-02-48N-M-2208N 


normal M26 


Clinomics 



Table 3 Cont 



As shown in Figure 10, SIM2 expression in normal samples (sample Nos. 46- 
54, Table 3) was significantly lower than in the cancer samples. Interestingly, high 
5 expression was found in adenocarcinoma samples (7 out of the 15 samples) and 
squamous cell carcinoma samples (9 out of the 14 samples). 
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EXAMPLE 9 

Expression ofSIM2short variant- derived fragment (SEQ W NO: 19) in normal and 

cancerous ovary tissues 

Expression of SIM2 short variant (GenBank Accession No: gi7108361, SEQ 
ID NO: 8) was measured by real time PCR using the sequence fragment set forth in 
SEQ ID NO: 19 (nucleotide coordinates 1224-2324 of SEQ ID NO: 7, amplified using 
the primers set forth in SEQ ID NOs: 16-17 designated as primers 14 and 15 of Figure 
1, respectively). In addition the expression of two housekeeping genes - PBGD was 
measured by real time PCR as described above. In each RT sample, the expression of 
SIM2 sequences was normalized to the geometric mean of the quantities of the 
housekeeping genes as detailed in Example 2, hereinabove. The normalized quantity 
of each RT sample was then divided by the averaged quantity of the normal samples 
(No. 42-49, Table 2 above). 

As shown in Figure 11, SIM2 expression in normal samples (sample Nos. 42- 
49, Table 2 above) was significantly lower than in the cancer samples. 

EXAMPLE 10 

Expression ofSIM2 long variant - derived fragment (SEQ ID NO:18) in normal 

and cancerous breast tissues 
Expression of SIM2 short variant (SEQ ID NO: 7) was evaluated by RT-PCR 
using a sequence fragment (SEQ ID NO: 18, described above). As shown in Figure 
12, SIM2 expression in most tumor samples was higher than in the normal samples 
(Table 4, below) . 

EXAMPLE 11 

Expression ofSJM2 short variant - derived fragment (SEQ ID NO:19) in normal 

and cancerous breast tissues 
Expression of SIM2 short variant (SEQ ID NO: 8) was evaluated by RT-PCR 
of a sequence fragment (SEQ ID NO: 19, primers 14 and 15, see Figure 1) normal and 
cancerous breast samples (see Table 4, below) 
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Table 4 



Serial number 


Cat No. , 


Pathology 


yource 


1 1 


M-0140T J 


DCIS Grade 1 


clinomics 


2 


h ** /\ ■* Ann 

M-0110T ] 


DCIS Grade 2 


-linomics 


3 


M-0150T 


[DC Grade 1 


3-linomics 


4 


M-2159T 


[DC Grade 1 


clinomics 


5 


M-2168T 


IDC Grade 1 


clinomics 


6 


723 8T 


IDC-G1 


ABS 


7 


7263T 


EDC-G2 


ABS 


8 


M-0113T 


EDC Grade 2 


clinomics 


9 


M-0160T 


IDC Grade 2 


clinomics 


10 


M-2160T 


IDC Grade 2 


clinomics 


11 


M-2175T 


IDC Grade 2 


i • • _ 

clinomics 


12 


1432T 


IDC-G2 


ABS 


13 


A0133T 


IDC-G2 


ABS 


14 


A0135T 


IDC-G2 


ABS 


15 


7259T 


IDC-G2 


ABS 


16 


20032T 


IDC-G2 


ABS 


17 


20036T 


IDC - G2(3) 


ABS 


18 


M-2169T 


IDC Grade 2 


clinomics 


19 


M-2162T 


IDC Grade 2 


clinomics 


20 


M-0111T 


IDC Grade 3 


clinomics 


21 


M-0112T 


IDC Grade 3 


clinomics 


22 


M-0114T 


IDC Grade 3 


clinomics 


23 


M-0115T 


IDC Grade 3 


clinomics 


24 


M-0180T 


EDC Grade 3 


clinomics 


25 


M-2176T 


T'T*\ >^ y» * O 

IDC Grade 3 


clinomics 


26 


7249T 


IDC-G3 


ABS 


27 


20072T 


DDC-G3 


ABS 


28 


M-2161T 


IDC Grade 3 


clinomics 


29 


M-2170T 


IDC Grade 3 


clinomics 


30 


M-2177T 


IDC Grade 3 


clinomics 


31 


CG-154 


EDC 


T_l_ *1 

Icnilov 


32 


7116T 


Mucinous carcinoma 


ABS 


33 


M-0140N 


normal matched to IT 


clinomics 


34 


TV>r All AVT 


normal maicnea to zi 


C llllOIIUkxO 


35 


7238N 


normal matched to 6T 


ABS 


36 


7263N 


normal matched to 7T 


ABS 


37 


M-0150N 


normal matched to 3T 


clinomics 


38 


7116N 


normal matched to 32T 


ABS 


39 


7259N 


normal mathed to 15T 


ABS 


40 


1432N 


normal mathed to 12T 


ABS 


41 


7249N 


normal mathed to 26T 


ABS 


42 


2003 IT 


IDC Grade 3 


ABS 



IDC= Invasive Ductal Carcinoma 
DCIS= Ductal Carcinoma In Situ 
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As shown in Figure 13, SIM2 expression in normal samples (sample Nos. 33- 
41, Table 4 above) was significantly lower than in the cancer samples. 

It is appreciated that certain features of the invention, which are, for clarity, 
5 described in the context of separate embodiments, may also be provided in 
combination in a single embodiment. Conversely, various features of the invention, 
which are, for brevity, described in the context of a single embodiment, may also be 
provided separately or in any suitable subcombination. 

10 Although the invention has been described in conjunction with specific 

embodiments thereof, it is evident that many alternatives, modifications and variations 
will be apparent to those skilled in the art. Accordingly, it is intended to embrace all 
such alternatives, modifications and variations that fall within the spirit and broad 
scope of the appended claims. All publications, patents and patent applications 

15 mentioned in this specification are herein incorporated in their entirety by reference 
into the specification, to the same extent as if each individual publication, patent or 
patent application was specifically and individually indicated to be incorporated herein 
by reference. In addition, citation or identification of any reference in this application 
shall not be construed as an admission that such reference is available as prior art to 

20 the present invention. 
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1. A method of diagnosing predisposition to, or presence of ovarian 
cancer, breast cancer and/or lung cancer in a subject, the method comprising 
determining a level of SIM2 in a biological sample obtained from the subject, said 
level being correctable with predisposition to, or presence or absence of the ovarian 
cancer, breast cancer and/or lung cancer, thereby diagnosing predisposition to, or 
presence of ovarian cancer, breast cancer and/or lung cancer in the subject. 

2. The method of claim 1, wherein said biological sample is a tissue 
sample and/or a body fluid sample. 

3. The method of claim 2, wherein said tissue sample is selected from the 
group consisting of an ovarian tissue, a lung tissue and a breast tissue. 

4. The method of claim 1, wherein said SIM2 is selected from the group 
consisting of SEQ ID NOs: 1, 2, 3, 7, 8 and 9. 

5. The method of claim 1, wherein said determining level of said SIM2 is 
effected at an mRNA level. 

6. The method of claim 1, wherein said determining level of said SIM2 is 
effected at a protein level. 

7. The method of claim 1, wherein said determining level of said SIM2 is 
effected at a gene amplification level. 

8. A method of treating ovarian cancer, breast cancer and/or lung cancer 
in a subject, the method comprising downregulating expression or activity of S1M2 in 
a lung tissue, breast tissue and/or an ovarian tissue, thereby treating the ovarian cancer, 
breast cancer and/or lung cancer in the subject. 
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9. The method of claim 8 wherein said SDVL2 is selected from the group 
consisting of SEQ ID NOs: 1, 2, 3, 7, 8 and 9. 

10. The method of claim 8, wherein downregulating expression or activity 
of said SIM2 is effected hy administering to the subject: 

(a) a molecule which binds SIM2; 

(b) an enzyme which cleaves SIM2; 

(c) an antisense polynucleotide capable of specifically hybridizing with 
an mRNA transcript encoding SIM2; 

(d) a ribozyme which specifically cleaves SIM2 transcripts; 

(e) a non-functional analogue of at least a catalytic or binding portion of 
SIM2; 

(f) a molecule which prevents S1M2 activation or substrate binding; 

(g) an siRNA molecule capable of inducing degradation of SIM2 
transcripts; 

(h) a DNAzyme which specifically cleaves SIM2 transcripts or DNA; and 

(i) a molecule which promotes a SIM2-specific immunogenic response. 

11. The method of claim 10, wherein said molecule which binds SIM2 is an 
antibody or antibody fragment capable of specifically binding said SIM2. 

12. Use of an agent capable of downregulating SIM2 expression or activity 
for the treatment of ovarian, breast and/or lung cancer. 

13. The use of claim 12, wherein said agent capable of downregulating 
SIM2 activity is an antibody or antibody fragment. 

14. The use of claim 12, wherein said agent capable of downregulating 
SIM2 expression or activity is an oligonucleotide. 



15. The use of claim 14, wherein said oligonucleotide is a single or 
double stranded polynucleotide. 
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16. The use of claim 14, wherein said oligonucleotide is at least 17 bases 

long. 

17. The use of claim 14, wherein said oligonucleotide is hybridizable in 
either sense or antisense orientation. 

18. Use of a SIM2 detecting agent for detecting ovarian, breast and/or lung 

cancer. 

19. The use of claim 18, wherein said agent for detecting ovarian, breast 
and/or lung cancer is an oligonucleotide . 

20. The use of claim 18, wherein said agent for detecting ovarian, breast 
and/or lung cancer is an antibody or antibody fragment. 

21. The use of claim 18, wherein said agent for detecting ovarian, breast 
and/or lung cancer is coupled to a detectable moiety selected from the group 
consisting of a chromogenic moiety, a fluorogenic moiety, a radioactive moiety and a 
light-emitting moiety. 

22. An article-of-manufacture comprising a packaging material and a 
composition identified for treating ovarian, breast and/or lung cancer being contained 
within said packaging material, said composition including, as an active ingredient, an 
agent capable of downregulating SIM2 expression or activity. 

23. The article-of-manufacture of claim 22, wherein said agent capable of 
downregulating SIM2 activity is an antibody or antibody fragment. 

24. The article-of-manufacture of claim 22, wherein said agent capable of 
downregulating SIM2 expression or activity is an oligonucleotide. 



25. The article-of-manufacture of claim 24, wherein said oligonucleotide 
is a single or double stranded polynucleotide. 
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26. The article-of-mannfacture of claim 24, wherein said oligonucleotide 
is at least 17 bases long. 

27. The article-of-manufacture of claim 24, wherein said oligonucleotide 
is hybridizable in either sense or antisense orientation. 

28. The article-of-manufacture of claim 22, wherein said agent capable of 
downregulating SIM2 expression or activity is an antibody or antibody fragment. 

29. The article-of-manufacture of claim 22, wherein said S1M2 is selected 
from the group consisting of SEQ ID NOs: 1, 2, 3, 7, 8 and 9. 

30. An isolated polynucleotide comprising a nucleic acid sequence 
encoding a polypeptide being at least 80 % homologous to SEQ ID NO: 39, 40 or 41 
as determined using the BestFit software of the Wisconsin sequence analysis package, 
utilizing the Smith and Waterman algorithm, where the gap creation equals 8 and gap 
extension penalty equals 2. 

31. The isolated polynucleotide of claim 30, wherein said polypeptide is as 
set forth in SEQ ID NO: 39, 40 or 41. 

32. An isolated polynucleotide comprising a nucleic acid sequence being 
80 % identical to SEQ ID NO: 39, 40 or 41, as determined using the BestFit software 
of the Wisconsin sequence analysis package, utilizing the Smith and Waterman 
algorithm, where gap weight equals 50, length weight equals 3, average match equals 
10 and average mismatch equals -9. 

33. The isolated polynucleotide of claim 32, wherein said nucleic acid 
sequence is as set forth in SEQ BO NO: 2 or 3. 



34. An isolated polynucleotide as set forth in SEQ ID NO: 2 or 3. 
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35. 

claim 30. 

36. An isolated polypeptide as set forth in SEQ ID NO: 39, 40 or 41. 

37. A method of diagnosing predisposition to, or presence of cancer in a 
subject, the method comprising determining a level of SEQ ID NO: 2 and/or 3 in a 
biological sample obtained from the subject, wherein said biological sample is 
suspected of being a cancerous tissue or associated with said cancerous tissue and 
whereas said level being correlatable with predisposition to, or presence or absence of 
the cancer, thereby diagnosing predisposition to, or presence of cancer in the subject. 

38. The method of claim 37, wherein said determining level of said SEQ ID 
NO: 2 and/or 3 is effected at an mKNA level. 

39. The method of claim 37, wherein said determining level of said SEQ ID 
NO: 2 and/or 3 is effected at a protein level. 

40. The method of cl aim 37, wherein said determining level of said SEQ ID 
NO: 2 and/or 3 is effected at a gene amplification level. 

41. A method of treating cancer in a subject, the method comprising 
downregulating expression or activity of SEQ ID NO: 2 and/or 3 in a cancerous tissue, 
thereby treating the cancer in the subject. 



PCT/IL2003/000636 



66 

A nucleic acid construct comprising the isolated polynucleotide of 
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SEQUENCE LISTING 

<110> Hermesh, Chen 
Walach , Shira 
Rotman , Galit 
Sela-Tavor, Osnat 

<120> SIM2 POLYPEPTIDES AND POLYNUCLEOTIDES ENCODING SAME AND OSES 

THEREOF IN DIAGNOSIS AND TREATMENT OF OVARIAN, BREAST AND LUNG 
CANCERS ' 

<130> 26340 

<160> 41 

<170> Patentln version 3.2 

<210> 1 

<211> 1001 

<212> DNA 

<213> Homo sapiens 



<400> 1 
ggaatattcg 


aaaccccgag 


cttttacaac 


ataaagcgca 


tggtgtggcc gcggcgggta 


60 


atggcgctct 


gggagccctg 


cccaggcggc 


j— » -I— X> +»• f~* 4~ ^1 /T 

Ct-CLigCtCy c 


cctcctccac 


ttccagctcc 


j. u 


gagctgggtg 


tgttgcaagt 


ttcatactcc 


tacatattat 


aagtgacact 


aatatcaggg 


180 


acaactaagt 


gctggggaac 


ttcaatgaaa 


acctggctgg 


taaagtcaac 


acccccagac 


240 


ttctctgtgc 


tacatttctt 


taattaattc 


cggagtggtg 


tgtggacggg 


cgtctttgca 


300 


gttattatac 


acgtaagtga 


attaggccat 


ttgaagctac 


gaagtcatac 


ccaacatttt 


360 


ccattaagaa 


tattattttt 




r*t" cinca arft" 

i_ y y v_- o. u. i» i» 


ttagaattta 


attatgataa 


420 


ttttcctctt 


ttcctcatta 


tcccagatat 


ggctggttgt 


gagatacttt 


ttcactaaat 


480 


gtgtcttttt 


aatgattttg 


gaattaagca 


agtatgccaa 


atgcgccaag 


acatttataa 


540 


ctttagaaat 


tgctgtatag 


tatatatttt 


tggaacacca 


caggtttagt 


tgggaaaata 


600 


ttttgcagct 


gagttagaaa 


cttgaaagtt 


aggcttataa 


tcaagatgct 


gattttcaac 


660 


cttagcatcg 


gggaaggtaa 


tgatagttta 


gttggcaaag 


actttttgca 


gcaaactgta 


720 


tttgagacag 


cagaatccaa 


ggatatcttt 


caagattcac 


ttatactaca ttctttttag 


780 


ccccctctct 


aggggtggag 


ggggtggctt 


agaaaaacca 


aaggtaatct 


ggtttcaatt 


840 


acatgctgta 


aaaatagaat 


ttgtggccag 


aaattaattt 


ggaatatttt 


ttatgggggc 


900 


aacattgtgg 


gttgtatgag 


tctttcacca 


actttattgc 


ttttctttgg ttctggatct 


960 


aaaatatgaa 


tgagtaaata 


aaatacagtt 


tcctttttca 


a 




1001 


<210> 2 

<211> 775 

<212> DNA 

<213> Homo sapiens 












<400> 2 
tttacaacat 


aaagcgcatg 


gtgtggccgc 


ggcgggtaat 


ggcgctctgg gagccctgcc 


60 


caggcggcct 


ctgctcgccc 


tcctccactt 


ccagctccga 


gctgggtgtg 


ttgcaagttt 


120 
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catactccta catattataa gtgacactaa tatcagggac aactaagtgc tggggaactt 
caatgaaaac ctggctggta aagtcaacac ccccagactt ctctgtgcta catttcttta 
attaattccg gagtggtgtg tggacgggcg tctttgcagt tattatacac actttggatg 
gatttgtttt tgtggtagca tctgatggca aaatcatgta tatatccgag accgcttctg 
tccatttagg cttatcccag gtggagctca cgggcaacag tatttatgaa tacatccatc 
cttctgacca cgatgagatg accgctgtcc tcacggccca ccagccgctg caccaccacc 
tgctccaaga gtatgagata gagaggtcgt tctttcttcg aatgaaatgt gtcttggcga 
aaaggaacgc gggcctgacc tgcagcggat acaaggtcat ccactgcagt ggctacttga 
agatcaggca gtatatgctg gacatgtccc tgtacgactc ctgctaccag attgtggggc 
tggtggccgt gggccagtcg ctgccaccca gtgccatcac cgagatcaag ctgtacagta 
acatgttcat gttcagggcc agccttgacc tgaagctgat attcctggat tccag 

<210> 3 

<211> 8296 

<212> DNA 

<213> Homo sapiens 

<400> 3 

ggggctccgc gggcctggag cacggccggg tctaatatgc ccggagccga ggcgcgatga 
aggagaagtc caagaatgcg gccaagacca ggagggagaa ggaaaatggc gagttttacg 
agcttgccaa gctgctcccg ctgccgtcgg ccatcacttc gcagctggac aaagcgtcca 
tcatccgcct caccacgagc tacctgaaga tgcgcgccgt cttccccgaa ggtgaggcct 
caggtgggcg gccggggacg ctggggagcc cggcggcccc ggcccaggcg ggaagcgcaa 
gccagcccgc ccagaggggt tgccgcggcc tggcgtccag agctggggcg tctgagggag 
gttgcgtgag ggtcttcggc ttcggcgctg gcttggggcg aggggccagg gccttggcgg 
cccaggcgac caaaccctct cctggtccag ggctgggtga gggcgaatta cgaattgttc 
caggggcagg cagtccccca gcccgcacgg ccagcgagcg ctgcgagtca gcggggatca 
cggtgaggcc caagcactgc aggctgaggc cacagagcga acacttgtgc tgagccgggc 
cctctcgtga ggctggggtg cgggaagtcc gggcaggaga gacccgcccc cgccgttgct 
gagctgagac ccggctgaaa gagaggggtc cgattaattc gaaaatggca gacagagctg 
agcgctgccg ttcttttcag gattgaaaat gtgccagtgg gccaggggcg ctgggacccg 
cggtgcggaa gactcggaac aggaagaaat agtggcgcgc tgggtgggct gccccgccgc 
ccacgccggt tgccgctggt gacagtggct gcccggccag gcacctccga gcagcaggtc 
tgagcgtttt tggcgtccca agcgttccgg gccgcgtctt ccagagcctc tgctcccagc 
ggggtcgctg cggcctggcc cgaaggattt gactctttgc tgggaggcgc gctgctcagg 
gttctggtgg gtcctctggg cccaggagct gggagggctg cgccggcctc tggagccccg 
ggagccagtg ccgaggtagg gagacaactt ccgccgcagg gcgccggacg gtcggggcag 



180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

775 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
10.80 
1140 
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agcaggcgac aggtgtccct aggccgcagg gcgcttccat agcgccatcc ccaccaggca 
ctctactcga aatcggaaag ctcgaccttt tgcgttcgcc tctgccaagc ctgttatttg 
tgctggccgc tgggtctgga gctgcgcttc tcggcccctc cccggtggag cgcagagggc 
tggtctgcaa gcgcggcctc cagccccgcg gctccccggc ccaggagcca ggcgcgggct 
gacccgggag cacccggcag cggagggggc tggaagcgga ccctaggcct ctcctgtgcc 
acccggccct accgcgcggc cgcggggcgc tctcctctcg ggcgcagcgg tccttcagcc 
cagggcaggt tcctcccttt cctactcgga acgtggcaaa gataccccag tcccagcccc 
tccagctgag agctgttgcc caaggtcgtc gctacttgtc cgctcaatgg tgaccccttg 
gcagagaact agggatgatt ccactccggt tgatgtttta ggggaaatta aaagaacatt 
cggttttctg agtctccttc cggggaggcg tggtggtaac tggtttgctg ggaagagccg 
ttccttaacc gcatgcaaca aagcagggcg tcagagatgt gaggcaattc tctacctccg 
ctggaaaaaa tgcagattta ttaaaggtcg actgtttagc agaacaacgt agatttttta 
caacgctttc cccgtctctg ctttgaagcc tgccaggctg cagctgggga tccaggaggg 
aaagcccgca ggcgcagagg ggacaatccg ggaagtggta aaggggacac ccgggcacag 
ggcctgtgct ttcgttgcag gcgaggaagt ggagcgcgcg ctgcagattc agcgcggggc 
tagaggaggg gacctggatc cctgaacccc ggggcggaaa gggagcctcc gggcggctgt 
gggtgccgcg ctcctcggag ccagcagctg ctggggcggc gtccgaactc cccaggtctg 
cgcacggcaa tgggggcacc gggccttctg tctgtcctca gaatacgtag gatacccgcg 
ggcgacaagc cgggccaggc taggagcctc cttccctgcc cctccccatc ggccgcggga 
ggctttcttg gggcgtcccc acgaccaccc ccttctcacc cggtccccag tttggaaaaa 
ggcgcaagaa gcgggctttt cagggacccc ggggagaaca cgagggctcc gacgcgggag 
aaggattgaa gcgtgcagag gcgccccaaa ttgcgacaat ttactgggat ccttttgtgg 
ggaaaggagg cttagaggct caagctatag gctgtcctag agcaactagg cgagaacctg 
gccccaaact ccctccttac gccctggcac aggttcccgg cgactggtgt tcccaaggga 
gccccctgag cctaccgccc ttgcaggggg tcgtgctgcg gcttctgggt cataaacgcc 
gaggtcgggg gtggcggagc tgtagaggct gcccgcgcag aaagctccag gatcccaata 
tgtgcttgcg tggagcaggg agcggaagag gcagccggtc ctcaccctcc tctcccgcca 
cgcacatatc cttcttgact tcgaagtggt ttgcaatccg aaagtgagac cttgagtcct 
cagatggccg gcaacgcgcc gaggtcacgc tccccagaaa cacccctctc ccctccccta 
ccccagctcc ccctggggcg ggtggtaatt gggggaggag aggccgcagg cagggaaggg 
gtgggaaagc cagagaggga ggcacaaagt gatggcagcc cggcaaacac tggggcttcg 
ggctgggccg cgctcgttta atcccacaaa aatcccattt tggaggtgag aaatagaggt 
tagaggtcgg gcccttctgg agatcagacc gaggagacgg gcccagctgg cgtcttaaag 



1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 



WO 2004/012847 



PCT/IL2003/000636 



caaggagggg 


gagtcgggag 


gaggaccagc 


4 

ttcggaagag 


gagggatcgc 


ttggaggccg 


3180 


tgcagtgtga 


ggaacggcag 


gcagggtgtg 


ggaccaacat 


gcacacactc 


gcaggtgctg 


3240 


gggccaggga 


ggaatgaggc 


gctggctccc 


tttccctcca 


tttctccctg 


ggggtcccag 


3300 


caacctggcc 


atccctgact 


tccaacagca 


cagcgtcccc 


acaggtcctg 


cagtgctctg 


3360 


caggggtgca 


gggagctccc 


ctccccccag 


ccgcaacctc 


accttcctca 


cccccacccc 


3420 


tccggcagga 


aaccacaggc 


tgggttgggg 


acccctggtg 


ctccaagaga 


gcagtgcaga 


3480 


tttctcccca 


agtcttcccg 


ccatgggctt 


tgcaagaagc 


cagggcccag 


aggccacgct 


3540 


caccgttaac 


actgcacagg 


gcaaaggtgg 


ctccaggaca 


actgcccaac 


cccaggaacg 


3600 


acccagcagc 


agagaaaagg 


acagctgcca 


gggtgccttt 


gttccccgtc 


cctcgtatcc 


3660 


cgcgctgccc 


gggggctcct 


gcctttggtt 


cagtgctcgc 


ggcaccaccg 


cactcaggac 


3720 


ggcagtgggg 


ggctggggct 


ggggctgggc 


ctggcccagc 


gtgggttggg 


gcgggggacg 


3780 


cgccagcagc 


gcccgcagct 


cgctccgcag 


gggtcgcagc 


caggggtcgg 


gagctaggct 


3840 


cgtgggccgg 


gagacgccgg 


gcgcgttgtc 


ctccggggag 


gttggggtgc 


aggcggggaa 


3900 


gcctggggtc 


tcgcggggcg 


cagcagtcag 


gtcgagggtg 


cagcaggagg 


ggagtcctga 


3960 


cgggcaggtc 


cctctttccc 


ctggtgcgca 


acactggttg 


gtagcttttg 


cggaggtggt 


4020 


gaagaagggc 


aggaggcctg 


ttgagcggag 


gagtccgggg 


atccctaatt 


atgtgacagg 


4080 


agaccctttc 


cagttcggcc 


tgtggcccat 


ccctctctca 


ccgccggcag 


attggagtct 


4140 


gctctcgggg 


agcccccagc 


cttttctttt 


taacagaggg 


caaaggggcg 


acggcgagag 


4200 


cacagatggc 


ggctgcggag 


ccggggaggc 


ggcggggaga 


cgcgcgggac 


tcgtggggag 


4260 


ggctggcagg 


gtgcaggggt 


tccgcgtgac 


ctgcccggct 


cccaggcatc 


gggctgggcg 


4320 


ctgcagttta 


ccgatttgct 


ttcgtccctc 


gtccaggttt 


aggagacgcg 


tggggacagc 


4380 


cgagccgcgc 


cgggcccctg 


gacggcgtcg 


ccaaggagct 


gggatcgcac 


ttgctgcagg 


4440 


tagagcggcc 


tcgccggggg 


aggagcgcag 


ccgccgcagg 


ctcccttccc 


accccgccac 


4500 


cccagcctcc 


aggcgtccct 


tccccaggag 


cgccaggcag 


atccagaggc 


tgccgggggc 


4560 


tggggatggg 


gtggtcccca 


ctgcggaggg 


atggacgctt 


agcatgtcgg 


atgcggcctg 


4620 


cggccaaccc 


taccctaacc 


ctacagccca 


cccggataac 


cagaacttgg 


tgaggcctcc 


4680 


gggctcttgc 


ttggtttgga 


gccaggtgct 


tagcgccccg 


agcccggggc 


cattcaccct 


4740 


gcaggagctg 


cacgcgcccc 


tgacctcggc 


ttttccctgg 


cagcagaggg 


gctttgcggg 


4800 


tcggccgggt 


agccctgagc 


acagctcgcc 


acttccaggt 


gggctgttgg 


cgctggctgg 


4860 


ggacacatcc 


cgatctttca 


aatgcccttt 


acagagcctc 


atcaacgacc 


cgattcattc 


4920 


ccccctcctg 


tcatttgtct 


ctgccatcga 


aaaatgccta 


ccgagagctg 


ctctgcattt 


4980 


ccgccctcta 


ttttgtgttt 


tactttaaaa 


taataataaa 


aaaaatgttg 


gctgcaggac 


5040 


gccatgactt 


aggtcagcga 


gtcagccgct 


agctctgcat 


ttccaaaaag 


cagatctttt 


5100 


cacaactctc 


ttgccccaag 


tgccctggtg 


tggtttattt 


tttaaaatgc 


atgcctgcgg 


5160 
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aagagaagac 


ccggggaata 


ttcgaaaccc 


cgagctttta caacataaag 


cgcatggtgt 


5220 


ggccgcggcg 


agtaatggcg 


ctctgggagc 


cctgcccagg cggcctctgc 


tcgccctcct 


5280 


ccacttccag 


ctccgagctg 


ggtgtgttgc 


aagtttcata ctcctacata 


ttataagtga 


5340 


cactaatatc 


agggacaact 


aagtgctggg 


gaacttcaat gaaaacctgg 


ctggtaaagt 


5400 


caacaccccc 


agacttctct 


gtgctacatt 


tctttaatta attccggagt 


ggtgtgtgga 


5460 


cgggcgtctt 


tgcagttatt 


atacacgtaa 


gtgaattagg ccatttgaag 


ctacgaagtc 


5520 


atacccaaca 


ttttccatta 


agaatattat 


ttttttagct actgctggca 


acttttagaa 


5580 


tttaattatg 


ataattttcc 


tcttttcctc 


attatcccag atatggctgg 


ttgtgagata 


5640 


ctttttcact 


aaatgtgtct 


ctttaatgat 


tttggaatta agcaagcatg 


ccaaatgcgc 


5700 


caagacattt 


ataactttag 


aaattgctgt 


atagtatata tttttggaac 


accacaggtt 


5760 


tagttgggaa 


aatattttgc 


agctgagtta 


gaaacttgaa agttaggctt 


ataatcaaga 


5820 


tgctgatttt 


caaccttagc 


atcggggaag 


gtaatgatag tttagttggc 


aaagactttt 


5880 


tgcagcaaac 


tgtatttgag 


acagcagaat 


ccaaggatat ctttcaagat 


tcacttatac 


5940 


tacattcttt 


ttagccccct 


ctctaggggt 


ggagggggtg gcttagaaaa 


accaaaggta 


6000 


atctggtttc 


aattacatgc 


tgtaaaaata 


gaatttgtgg ccagaaatta 


atttggaata 


6060 


ttttttatgg 


gggcaacatt 


gtgggttgta 


tgagtctttc accaacttta 


ttgcttttct 


6120 


ttggttctgg 


atctaaaata 


tgaatgagta 


aataaaatac agtttccttt 


ttcaaaaata 


6180 


atttagtctt 


tttatccttg 


acacataaat 


ataatcatta cctttttatt 


tgtcttaatt 


6240 


acctcttata 


gttttacttt 


atttttgagg 


gttagccaac ttaggaaaaa 


tcagaatttc 


6300 


atttgggtag 


ttttcatttt 


tcagaagttg 


atacttttcc tccttgaatt 


ttgcaaaggt 


6360 


tatttgatct 


cttgcaaatt 


ctaaataatt 


aattctgtag acatagtctg 


gtaaaataac 


6420 


caggtatttt 


gttttatttt 


ctgggcagtg 


gtagtgttcg gaatttatta 


ttccaacggt 


6480 


acttagaggt 


aactagaaaa 


ttgtggcaaa 


taaagcatat gtcctagtgg 


tataaatgac 


6540 


tgccaattat 


gacataaaat 


ggtctatata 


gtggtagatt ttcccctaag 


taccatagaa 


6600 


ctccggattt 


gtactaagtg 


ctaaaataac 


cccaaacaat aaagctacta 


cgtttaacct 


6660 


gagcttgata 


ggaattctca 


aatgttttgc 


atgatttttg tgatgtggat 


gtatgattgt 


6720 


ttctaaagtg 


ctttaaaagc 


atttcagagc 


ctggcctggg aactaaaccc 


tgggtgctgt 


6780 


ttcctacagt 


cctttttaag 


accctgggct 


tcatttttgc atttagcaag 


tatgtgttga 


6840 


tggtcatatt 


tgatgcatcc 


ctccatccca 


gtgactgtac cattgttaga 


tttaactcta 


6900 


atgtagaatg 


atcatctctt 


attctgacac 


tttatctttt acagactttg 


gatggatttg 


6960 


tttttgtggt 


agcatctgat 


ggcaaaatca 


tgtatatatc cgagaccgct 


tctgtccatt 


7020 


taggcttatc 


ccaggtgggt 


attgcctaat 


tttatgtgca accaaaatat 


taaacgaagt 


7080 


gacagcagat 


tttgacagcc 


tcacccaact 


tgaaaatgag tctgtttaag 


tgtttgcttt 


7140 
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tgtgtaagaa 


cataacccta 


aattgcaaaa 


tcctcttctc aggacatcct gcaggggttt 


tcttctccat 


attacgtttt 


acttttttac 


tttctttttc ccttggggtt tcatgaccag 


tgtgtcttca 


ccccgtcagc 


cttgttccga 


gattctgcca aaacccggac tatctgctta 


tgtagtaatg 


ctagatgccc 


tgaaaagagc 


tccacaaagg gacatgtttc agtccacact 


tgctggtctg 


cacgttacgg 


tattatgagt 


tgctataaag caagaggtaa gctgatgtgg 


aacatggatt 


ttatgcagaa 


tgttctttgg 


ggagagagaa 


tggggagggg tagatttatg 


agtgttcgca 


tggaattcgg 


ggtcattttt 


cctgactttt 


tccagttcag tggtcgtttc 


agcaaagcat 


tttatatgct 


tgtgatagaa 


actcataaaa 


ctcagtttac tactgttttg 


aacagcaagg 


tttctttaga 


acatctgaat 


attaccaatt 


tttctggaaa gatggaatca 


tcttcttaaa 


aaagtccacc 


gtttccttag 


aggaaagagg 


gggactcagc tttgtctctc 


cctggcaagc 


caagggtttg 


atgggggatt 


tgggggctcc 


ctgggcccat tgaatgtctg 


aaaggggata 


gtgtaactga 


tcattctcat 


tcacatgcct 


gacttaacaa atgtggatca 


tgattttaaa 


agtcacattt 


taaactacat 


atagggacta 


aatatttctt cacattgtgt 


actatcagag 


tgatttttca 


ccgtgactga 


agacacacta gaattaatga tagatacaga 


ggatgggttc 


cagagaggcc 


catattgaat 


ttgaattaca 


cctgcctcct aatgtagtta 


agccaaaata 


aaacatgccg 


attaatgtgg 


cttcagtttg ctcattgaaa aacaaaatga 


ccacatttgg 


caaactgcat 


ctggtgtatt 


ccttttattt ttctgccaat actttttttt 


ataagcttgc 


tatttattta 


catccttctc 


ttctgatgta tggatatcct tatccagatc 


cacatgggtt 


tatgtaactc 


tgtgtttctc 


aggcaaaacc 


ctttaaaatg aattacatat 


taaaatgata 


tggaag 









<210> 4 

<211> 23 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 4 

tgggtgtgtt gcaagtttca tac 

<210> 5 

<211> 23 

<212> DNA 

<213> Artificial sequence 



7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8296 



23 



<220> 
<223> 



Single strand DNA oligonucleotide 



<400> 5 

ctttaccagc caggttttca ttg 



<210> 6 
<211> 22 
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<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 



<400> 6 
ccttggattc 


tgctgtctca 


aa 








22 


<210> 7 

<211> 3885 

<212> DNA 

<213> Homo sapiens 












<400> 7 
ggggctccgc 


gggcctggag 


cacggccggg 


tctaatatgc 


ccggagccga 


ggcgcgatga 


60 


aggagaagtc 


caagaatgcg 


gccaagacca 


ggagggagaa 


ggaaaatggc 


gagttttacg 


120 


agcttgccaa 


gctgctcccg 


ctgccgtcgg 


ccatcacttc 


gcagctggac 


aaagcgtcca 


180 


tcatccgcct 


caccacgagc 


tacctgaaga 


tgcgcgccgt 


cttccccgaa 


ggtttaggag 


240 


acgcgtgggg 


acagccgagc 


cgcgccgggc 


ccctggacgg 


cgtcgccaag gagctgggat 


300 


cgcacttgct 


gcagactttg 


gatggatttg 


tttttgtggt 


agcatctgat 


ggcaaaatca 


360 


tgtatatatc 


cgagaccgct 


tctgtccatt 


taggcttatc 


ccaggtggag 


ctcacgggca 


420 


acagtattta 


tgaatacatc 


catccttctg 


accacgatga 


gatgaccgct 


gtcctcacgg 


480 


cccaccagcc 


gctgcaccac 


cacctgctcc 


aagagtatga 


gatagagagg 


tcgttctttc 


540 


ttcgaatgaa 


atgtgtcttg 


gcgaaaagga 


acgcgggcct 


gacctgcagc 


ggatacaagg 


600 


tcatccactg 


cagtggctac 


ttgaagatca 


ggcagtatat 


gctggacatg 


tccctgtacg 


660 


actcctgcta 


ccagattgtg 


gggctggtgg 


ccgtgggcca 


gtcgctgcca 


cccagtgcca 


720 


tcaccgagat 


caagctgtac 


agtaacatgt 


tcatgttcag 


ggccagcctt 


gacctgaagc 


780 


tgatattcct 


ggattccagg 


gtgaccgagg 


tgacgggtta 


cgagccgcag 


gacctgatcg 


840 


agaagaccct 


ataccatcac 


gtgcacggct 


gcgacgtgtt 


ccacctccgc 


tacgcacacc 


900 


acctcctgtt 


ggtgaagggc 


caggtcacca 


ccaagtacta 


ccggctgctg 


tccaagcggg 


960 


gcggctgggt 


gtgggtgcag 


agctacgcca 


ccgtggtgca 


caacagccgc 


tcgtcccggc 


1020 


cccactgcat 


cgtgagtgtc 


aattatgtac 


tcacggagat 


tgaatacaag 


gaacttcagc 


1080 


tgtccctgga 


gcaggtgtcc 


actgccaagt 


cccaggactc 


ctggaggacc 


gccttgtcta 


1140 


cctcacaaga 


aactaggaaa 


ttagtgaaac 


ccaaaaatac 


caagatgaag acaaagctga 


1200 


gaacaaaccc 


ttacccccca 


cagcaataca 


gctcgttcca 


aatggacaaa 


ctggaatgcg 


1260 


gccagctcgg 


aaactggaga 


gccagtcccc 


ctgcaagcgc 


tgctgctcct 


ccagaactgc 


1320 


agccccactc 


agaaagcagt 


gaccttctgt 


acacgccatc 


ctacagcctg 


cccttctcct 


1380 


accattacgg 


acacttccct 


ctggactctc 


acgtcttcag 


cagcaaaaag 


ccaatgttgc 


1440 


cggccaagtt 


cgggcagccc 


caaggatccc 


cttgtgaggt 


ggcacgcttt 


ttcctgagca 


1500 


cactgccagc 


cagcggtgaa 


tgccagtggc 


attatgccaa 


ccccctagtg 


cctagcagct 


1560 
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cgtctccagc 


taaaaatcct 


ccagagccac 


cggcgaacac 


tgctaggcac agcctggtgc 


1620 


caagctacga 


agcgcccgcc 


gccgccgtgc 


gcaggttcgg 


cgaggacacc gcgcccccga 


1680 


gcttcccgag 


ctgcggccac 


taccgcgagg 


agcccgcgct 


gggcccggcc aaagccgccc 


1740 


gccaggccgc 


ccgggacggg 


gcgcggctgg 


cgctggcccg 


cgcggcaccc gagtgctgcg 


1800 


cgcccccgac 


ccccgaggcc 


ccgggcgcgc 


cggcgcagct 


gcccttcgtg ctgctcaact 


1860 


accaccgcgt 


gctggcccgg 


cgcggaccgc 


tggggggcgc 


cgcacccgcc gcctccggcc 


1920 


tggcctgcgc 


tcccggcggc 


cccgaggcgg 


cgaccggcgc 


gctgcggctc cggcacccga 


1980 


gccccgccgc 


cacctccccg 


cccggcgcgc 


ccctgccgca 


ctacctgggc gcctcggtca 


2040 


tcatcaccaa 


cgggaggtga 


cccgctggcc 


gcccgcgcca 


ggagcctgga cccggcctcc 


2100 


cggggctgcg 


gcgccaccga 


gcccggcaaa 


tgcgcacgac 


ctacattaat ttatgcagag 


2160 


acagctgttt 


gaattggacc 


ccgccgccga 


cttgcggatt 


tccaccgcgg aggccccgcg 


2220 


cgccggtgcc 


gagggccgag 


gagcgcccgg 


gtccgggcag 


gtgaccgccc gcctctgtcc 


2280 


tgcgagggcc 


ggtgcgaccc 


agttgctggg 


ggcttggttt 


cctcaccttg aaatcgggct 


2340 


tcacgcgtct 


tgccttgtcc 


ccaacgttcc 


acaacagtcc 


cgctggggga ttgaagcggt 


2400 


ttcactccgc 


aaatatcctc 


cactttcagg 


agggaaaacc 


caccctacca cagtccgctc 


2460 


ttccaagtgg 


acggcagacc 


tgggagggga 


cgcctgtgtc 


acgagccctt ttagatgctt 


2520 


aggtgaaggc 


agaagtgatg 


attgtaagtc 


ccatgaatac 


acaactccac tgtctttaaa 


2580 


agtcattcaa 


gagtctcatt 


atttttgttt 


ttatttaacc 


ctttcttcaa tacaaaaagc 


2640 


caacaaacca 


agactaaggg 


ggtgaccatg 


caattccatt 


ttgtgtctgt gaacataggt 


2700 


gtgcttccca 


aatacattaa 


caagctctta 


cttcccccta 


acccctatga actcttgata 


2760 


acaccaagag 


tagcaccttc 


agaatatatt 


gaataggcat 


taaatgcaaa aatatatatg 


2820 


tagccagaca 


gtttatgaga 


atgaccctgt 


caagcttcat 


tattacgtgg caaaatccct 


2880 


ctggcccaca 


cagatctgta 


attcactagg 


ctcgtgtttg 


ctacaaatag tgctaataaa 


2940 


gttaaattgc 


acgtgcaata 


cggaacactg 


tcaatggact 


gcaccttgtg aaggaaaaac 


3000 


atgcttaagg 


gggtgtaatg 


aaaatgatgt 


agacatttta 


agcattttct acacagcgag 


3060 


aaaacttcgt 


aagaacatgt 


tacgtgtgca 


acaggtaaac 


agaaatcctt tcataaagca 


3120 


ccagcagtgt 


ttaaaaaatg 


agcttccatt 


aatttttact 


ttttatgggt tttgcttaaa 


3180 


gatctcaaca 


tggaaaaatc 


ctgtcatggc 


tctgaactgc 


acaatgcatt gaaccgccgt 


3240 


ccttcaattt 


tcttcacact 


atcaacactg 


cagcattttg 


ctgctttatc aaaatggttt 


3300 


attttaggaa 


actttttcca 


cctttctgaa 


tggaaagagg 


ttttcacaaa tgttttaaac 


3360 


tcatcgttct 


aaaatcaagt 


gcacctacac 


caactgctct 


caaaatgtga actgactttt 


3420 


tttttttttt 


ttttgccaac 


cctgtgtcac 


ttagtgagga 


cctgacacaa tccctacagg 


3480 


gtgtctgtca 


gtgggcctca 


tggtaagagt 


cacaatttgc 


aaatttagga ccgtgggtca 


3540 
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tgcagcgaag 


9 

gggctggatg gtaggaaggg atgtgcccgc 


ctctccacgc 


actcagctat 


3600 


acctcattca 


cagctccttg 


tgagtgtgtg 


cacaggaaat 


aagccgaggg 


tattattttt 


3660 


ttatgttcat 


gagtcttgta 


attaaaccgt 


gattcttgaa 


aggtgtaggt 


ttgattacta 


3720 


ggagatacca 


ccgacatttt 


tcaataaagt 


actacaaaat 


gcttttgtgt ctaccttgtt 


3780 


attaactttt 


ggggctgtat 


ttagtaaaaa 


taaatcaagg 


etateggage agttcaataa 


3840 


caaaggttac 


tgttgagaaa 


aaagacccta 


tcatagattt 


acaag 




3885 


<210> 8 

<211> 2823 

<212> DNA 

<213> Homo sapiens 












<400> 8 
y y gy l. ttcg c 


gggcctggag 


cacggccggg 


fcctaatatcrc 


ccggagccga 


ggcgcgatga 


60 


dyy dyaay 


caagaatgcg gccaagacca 


y y ^*y y y ~y qq 


ggaaaatggc 


gagttttacg 


120 


ay ty u^ad. 


gctgctcccg 


ctgccgtcgg 


ppai~papt"1~p 


gcagctggac 


aaagegtcca 


180 


4- o ^ 4~ »-» j-r « *** +- 


caccacgagc 


tacctgaaga 


+■ err* cr net p p cr fc 


cttccccgaa 


ggtttaggag 


240 


a f» /"» it 4" rr/t/rrr 

auyuy Lyyyy 


acagccgagc 


cgcgccgggc 


p p p "1" cr ci a p er cr 


cgtcgccaag 


gagctgggat 


300 


f^rrr* a f t~ +~ cr c' f~ 


gcagactttg 


gatggatttg 


tttttatoat 


agcatctgat 


ggcaaaatca 


360 


uy LdLaLaL-u 


cgagaccgct 


tctgtccatt 


"baCTcrcttafcc 

i^ciy y v ^ 


ccaggtggag 


ctcacgggca 


420 


2» ^y4~ SS 4™ 4" 4~ o 

CLL-Ciy LaL Cud 


tgaatacatc 


catccttctg 


a c*p a e era t* era 


gatgaccget 


gtcctcacgg 


480 


ccca.cca.gcc 


gctgcaccac 


cacctgctcc 


aanaer4-a1~era 


gatagagagg 


tegttcttte 


540 


4" 4~ >T ^ ^ 4- Q 

LtCyaatyaa 


atgtgtcttg 


gcgaaaagga 


a prrpcrrrer prt/ 


gacctgcagc ggatacaagg 


600 


tcatccactg 


cagtggctac 


ttgaagatca 


crcrcacftatat 


gctggacatg 


tccctgtacg 


660 


actcctgcta 


ccagattgtg 


gggctggtgg 


cccftcrcracca. 


gtcgctgcca 


cccagtgcca 


720 


tcaccgagat 


caagctgtac 


agtaacatgt 


tcatgttcag 


ggccagcctt 


gacctgaagc 


780 


tgatattcct 


ggattccagg 


gtgaccgagg 


tcraccrereftfca. 

\— y uwm y y ^— 


cgagccgcag 


gacctgatcg 


840 


agaagaccct 


ataccatcac 


gtgcacggct 


rrp era pert" rr 4-4- 
y vy u^y uy i_ u 


ccacctccgc 


tacgcacacc 


900 


acctcctgtt 


ggtgaagggc 


caggtcacca 


l^loClGiy Uu 1 — U>CL 


ccggctgctg tecaageggg 


960 


gcggctgggt 


gtgggtgcag 


agctacgcca 


p p fT 4- rrrff- ct pa 
t—i^y Lyy Ly ua 


caacagccgc tcgtcccggc 


1020 


cccactgcat 


cgtgagtgtc 


aattatgtac 


\"ca ccs ci a era 4- 
Luauyyayci v» 


tgaatacaag gaacttcagc 


1080 


tgtccctgga 


gcaggtgtcc 


actgccaagt 


cccaggactc 


ctggaggacc 


gecttgtcta 


1140 


cctcacaaga 


aactaggaaa 


ttagtgaaac 


ccaaaaatac 


caagatgaag 


acaaagctga 


1200 


gaacaaaccc 


ttacccccca 


cagcaataca 


gctcattcca 


aatggacaaa 


ctggaatgcg 


1260 


gccagctcgg 


aaactggaga 


gccagtcccc 


ctgcaagcgc 


tgctgctcct 


ccagaactgc 


1320 


agccccactc 


agaaagcagt 


gaccttctgt 


acacgccatc 


ctacagcctg 


cccttctcct 


1380 


accattatgg 


acacttccct 


ctggactctc 


acttcttcag 


cagcaaaaag 


ccaatgttgc 


1440 
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cggccaagtt cgggcagccc caaggatccc cttgtgaggt ggcacgcttt ttcctgagca 1500 

caatgccagc cagcggtgaa tgccagtggc attatgccaa ccccctagtg cctagcagct 1560 

cgtctccagc taaaaatcct ccagagccac cggcgaacac tgctaggcac agcctggtgc 1620 

caagctacga aggtgggtca ggtctgctcg tggggaaggt gggaggactg cgcacggccg 1680 

ggagccgaag cagccatggc ggtgggtggc agatggagac agaaccctca cgctttgggc 1740 

aaacttgccc tctttctgct tctaagtagg gcttgctgtg ctttcttgct ctcaatgcag 1800 

gtgctcctcg agagtgagaa atggcagtct gcctgcctcg gggacactag tgacagtata 1860 

aagggcaaag gaaaaccgag tatctggcct tcacgtaaat cctggccaca ttcaccaacc 1920 

aaagggggac agtgattttc aaaaccagct cccatgtgct gagaacaccc cagctgcatt 1980 

tcttttgcaa gattcctttc cactccaacc agaagtgaat atttgagaca aacggcctat 2040 

tggctatttt cccatgccag ttttggaagt ggggaaaact atggtggaaa tttgtgggct 2100 

tggggacaga aatgccactc accaacccag ggcaaagaac acaaaccctc caggcctcag 2160 

tttcttcacc tgtaaaatgg ggtgaagctg tgatgtgcct actcccaagg acacgacaca 2220 

cagtagggac ctgccctgta catgctagtt caacagaaag gaatggcctt tcaccttctc 2280 

ctggtggcag gcaagcagat gtcctctgcg gagataccgc cagctcccca ggacgcagac 2340 

tgactcctgt ttgctcgctg gaccaacccc aggcagaagg tggaaggtgg gaacagaggt 2400 

ttagctgcag gacatgtatt cccattgcac cgagacctaa ctgccgctca gagtgtagac 2460 

cgagatggtg cagatgcctg cagtgccatt aaaatgtggg tgaaggtgac atcaggatta 2520 

tgtgccccag gccgggctca gtggctcaca cctgtaatcc cagcactttg ggaggccaag 2580 

gtgggcggat cacctgaggt caggagtttg cgacaagcct gccaacaagc tgaaacccca 2640 

tctccactaa aaatacaaaa attagttggg catggtggtg agcacctgta atcccagcta 2700 

ctctggaggc tgagatagga ggatcacttg aacccgggag gtggaggttg cagtgagcta 27 60 
agatcacatc actgcactcc agcctgggta acagagtgag actgtctcaa aaaaaaaaaa 
aaa 

<210> 9 
<211> 99 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Real time PCR amplicon 
<400> 9 

aggagctggg atcgcacttg ctgcagactt tggatggatt tgtttttgtg gtagcatctg 60 
atggcaaaat catgtatata tccgagaccg cttctgtcc 99 

<210> 10 

<211> 19 

<212> DNA 

<213> Artificial sequence 



2820 
2823 
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<220> 

<223> Single strand DNA oligonucleotide 



<400> 10 

ggagctggga tcgcacttg 



19 



<210> 11 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 11 

ggacagaagc ggtctcggat a 21 



<210> 12 

<211> 23 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 



<210> 13 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 13 

gtggctactt gaagatcagg ca 22 



<210> 14 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 



<210> 15 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 



<400> 12 

tttacaacat aaagcgcatg gtg 



23 



<400> 14 

cctagcagct cgtctccagc 



20 



<400> 15 

ggtgtcctcg ccgaacct 



18 
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<210> 16 
<211> 21 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 
<400> 16 

tagggacctg ccctgtacat g 21 

<210> 17 

<211> 17 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 17 

gctggcggta tctccgc 17 

<210> 18 
<211> 120 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Real time PCR amplicon 
<400> 18 

cctagcagct cgtctccagc taaaaatcct ccagagccac cggcgaacac tgctaggcac 60 
agcctggtgc caagctacga agcgcccgcc gccgccgtgc gcaggttcgg cgaggacacc 120 

<210> 19 
<211> 101 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Real time PCR amplicon 
<400> 19 

tagggacctg ccctgtacat gctagttcaa cagaaaggaa tggcctttca ccttctcctg 60 
gtggcaggca agcagatgtc ctctgcggag ataccgccag c 101 

<210> 20 
<211> 101 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Real time PCR amplicon 
<400> 20 

tgggtgtgtt gcaagtttca tactcctaca tattataagt gacactaata tcagggacaa 60 
ctaagtgctg gggaacttca atgaaaacct ggctggtaaa g 101 
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<210> 21 
<211> 20 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 
<400> 21 

ctggcaagca gctggaagat 



<210> 22 

<211> 23 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 22 

tttcttagca ccaccacgaa gtc 



<210> 23 
<211> 101 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Real time PCR amplicon 
<400> 23 

ctggcaagca gctggaagat ggacgtactt 
ctcttcatct tgtgttgaga cttcgtggtg 



tgtctgacta caatattcaa aaggagtcta 
gtgctaagaa a 



<210> 24 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 



<400> 24 

cagtgattat aggctttcgc tctaa 



25 



<210> 25 

<211> 23 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 25 

cagggctatt ggttgaatga gta 



<210> 26 

<211> 134 

<212> DNA 

<213> Artificial sequence 
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<220> 

<223> Real time PCR amplicon 



<400> 26 

cagtgattat aggctttcgc tctaagatta aaaatgccct agcccacttc ttaccacaag 



60 



gcacacctac accccttatc cccatactag ttattatcga aaccatcagc ctactcattc 



120 



aaccaatagc cctg 



134 



<210> 27 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 27 

gtaacccgtt gaaccccatt 20 



<210> 28 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 



<210> 29 
<211> 151 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Real time PCR amplicon 
<400> 29 

gtaacccgtt gaaccccatt cgtgatgggg atcggggatt gcaattattc cccatgaacg 60 
aggaattccc agtaagtgcg ggtcataagc ttgcgttgat taagtccctg ccctttgtac 120 
acaccgcccg tcgctactac cgattggatg g 151 



<210> 30 

<211> 19 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 



<400> 28 

ccatccaatc ggtagtagcg 



20 



<400> 30 

tgagagtgat tcgcgtggg 



19 



<210> 31 
<211> 21 
<212> DNA 



<213> Artificial sequence 
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<220> 

<223> Single strand DNA oligonucleotide 
<400> 31 

ccagggtacg aggctttcaa t 



<210> 32 
<211> 91 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Real time PCR amplicon 
<400> 32 

tgagagtgat tcgcgtgggt acccgcaaga 
tggtggcaac attgaaagcc tcgtaccctg 



gccagcttgc tcgcatacag acggacagtg 



<210> 33 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 33 

tgacactggc aaaacaatgc a 

<210> 34 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

<400> 34 

ggtccttttc accagcaagc t 

<210> 35 
<211> 94 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Real time PCR amplicon 
<400> 35 

tgacactggc aaaacaatgc agactttgct ttccttggtc aggcagtata atccaaagat 60 
ggtcaaggtc gcaagcttgc tggtgaaaag gacc 94 



<210> 36 

<211> 20 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Single strand DNA oligonucleotide 



+ 
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<400> 36 

tgggaacaag agggcatctg 20 

<210> 37 

<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Single strand DNA oligonucleotide 

22 



<400> 


37 


ccaccactgc atcaaattca tg 


<210> 


38 


<211> 


86 


<212> 


DNA 


<213> 


Artificial sequence 


<220> 




<223> 


Real time PCR amplicon 


<400> 


38 



tgggaacaag agggcatctg ctaaagtttc agattccatt tctgctcagt atccagtagt 60 
ggatcatgaa tttgatgcag tggtgg / 86 

<210> 39 

<211> 160 

<212> PRT 

<213> Homo sapiens 

<400> 39 

Leu Asp Gly Phe Val Phe Val Val Ala Ser Asp Gly Lys lie Met Tyr 
15 10 15 

lie Ser Glu Thr Ala Ser Val His Leu Gly Leu Ser Gin Val Glu Leu 
20 25 30 

Thr Gly Asn Ser lie Tyr Glu Tyr lie His Pro Ser Asp His Asp Glu 
35 40 45 

Met Thr Ala Val Leu Thr Ala His Gin Pro Leu His His His Leu Leu 
50 55 60 

Gin Glu Tyr Glu lie Glu Arg Ser Phe Phe Leu Arg Met Lys Cys Val 
65 70 75 80 

Leu Ala Lys Arg Asn Ala Gly Leu Thr Cys Ser Gly Tyr Lys Val lie 
85 90 95 

His Cys Ser Gly Tyr Leu Lys He Arg Gin Tyr Met Leu Asp Met Ser 
100 105 HO 



Leu Tyr Asp Ser Cys Tyr Gin He Val Gly Leu Val Ala Val Gly Gin 
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115 120 125 



Ser Leu Pro Pro Ser Ala He Thr Glu He Lys Leu Tyr Ser Asn Met 
130 135 140 



Phe Met Phe Arg Ala Ser Leu Asp Leu Lys Leu lie Phe Leu Asp Ser 
145 150 155 160 



<210> 40 

<211> 178 

<212> PRT 

<213> Homo sapiens 

<400> 40 

Met Lys Glu Lys Ser Lys Asn Ala Ala Lys Thr Arg Arg Glu Lys Glu 
1 5 10 15 



Asn Gly Glu Phe Tyr Glu Leu Ala Lys Leu Leu Pro Leu Pro Ser Ala 
20 25 30 



lie Thr Ser Gin Leu Asp Lys Ala Ser He lie Arg Leu Thr Thr Ser 
35 40 45 



Tyr Leu Lys Met Arg Ala Val Phe Pro Glu Gly Glu Ala Ser Gly Gly 
50 55 60 



Arg Pro Gly Thr Leu Gly Ser Pro Ala Ala Pro Ala Gin Ala Gly Ser 
65 70 75 80 



Ala Ser Gin Pro Ala Gin Arg Gly Cys Arg Gly Leu Ala Ser Arg Ala 
85 90 95 



Gly Ala Ser Glu Gly Gly Cys Val Arg Val Phe Gly Phe Gly Ala Gly 
100 105 110 



Leu Gly Arg Gly Ala Arg Ala Leu Ala Ala Gin Ala Thr Lys Pro Ser 
115 120 125 



Pro Gly Pro Gly Leu Gly Glu Gly Glu Leu Arg He Val Pro Gly Ala 
130 135 140 



Gly Ser Pro Pro Ala Arg Thr Ala Ser Glu Arg Cys Glu Ser Ala Gly 
145 150 155 160 



He Thr Val Arg Pro Lys His Cys Arg Leu Arg Pro Gin Ser Glu His 
165 170 175 



Leu Cys 



<210> 41 
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<211> 584 
<212> PRT 
<213> Homo sapiens 

<400> 41 

Met Arg Arg Trp Leu Pro Phe Pro Pro Phe Leu Pro Gly Gly Pro Ser 
is 10 15 



Asn Leu Ala He Pro Asp Phe Gin Gin His Ser Val Pro Thr Gly Pro 
20 25 30 

Ala Val Leu Cys Arg Gly Ala Gly Ser Ser Pro Pro Pro Ser Arg Asn 
35 40 45 

Leu Thr Phe Leu Thr Pro Thr Pro Pro Ala Gly Asn His Arg Leu Gly 
50 55 60 

Trp Gly Pro Leu Val Leu Gin Glu Ser Ser Ala Asp Phe Ser Pro Ser 
65 ™ 75 80 

Leu Pro Ala Met Gly Phe Ala Arg Ser Gin Gly Pro Glu Ala Thr Leu 
85 90 95 

Thr Val Asn Thr Ala Gin Gly Lys Gly Gly Ser Arg Thr Thr Ala Gin 
100 105 HO 

Pro Gin Glu Arg Pro Ser Ser Arg Glu Lys Asp Ser Cys Gin Gly Ala 
115 " 120 125 

Phe Val Pro Arg Pro Ser Tyr Pro Ala Leu Pro Gly Gly Ser Cys Leu 
130 135 140 

Trp Phe Ser Ala Arg Gly Thr Thr Ala Leu Arg Thr Ala Val Gly Gly 
145 150 155 160 

Trp Gly Trp Gly Trp Ala Trp Pro Ser Val Gly Trp Gly Gly Gly Arg 
165 170 175 

Ala Ser Ser Ala Arg Ser Ser Leu Arg Arg Gly Arg Ser Gin Gly Ser 
180 185 190 

Gly Ala Arg Leu Val Gly Arg Glu Thr Pro Gly Ala Leu Ser Ser Gly 
195 ' 200 205 

Glu Val Gly Val Gin Ala Gly Lys Pro Gly Val Ser Arg Gly Ala Ala 
210 215 220 

Val Arg Ser Arg Val Gin Gin Glu Gly Ser Pro Asp Gly Gin Val Pro 
225 230 235 240 



Leu Ser Pro Gly Ala Gin His Trp Leu Val Ala Phe Ala Glu Val Val 
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245 250 255 



Lys Lys Gly Arg Arg Pro Val Glu Arg Arg Ser Pro Gly lie Pro Asn 
260 265 270 



Tyr Val Thr Gly Asp Pro Phe Gin Phe Gly Leu Trp Pro lie Pro Leu 
275 280 285 



Ser Pro Pro Ala Asp Trp Ser Leu Leu Ser Gly Ser Pro Gin Pro Phe 
290 295 300 



Leu Phe Asn Arg Gly Gin Arg Gly Asp Gly Glu Ser Thr Asp Gly Gly 
305 310 315 320 



Cys Gly Ala Gly Glu Ala Ala Gly Arg Arg Ala Gly Leu Val Gly Arg 
325 330 335 



Ala Gly Arg Val Gin Gly Phe Arg Val Thr Cys Pro Ala Pro Arg His 
340 345 350 



Arg Ala Gly Arg Cys Ser Leu Pro lie Cys Phe Arg Pro Ser Ser Arg 
355 360 365 



Phe Arg Arg Arg Val Gly Thr Ala Glu Pro Arg Arg Ala Pro Gly Arg 
370 375 380 



Arg Arg Gin Gly Ala Gly lie Ala Leu Ala Ala Gly Arg Ala Ala Ser 
385 390 395 ~ " 400 



Pro Gly Glu Glu Arg Ser Arg Arg Arg Leu Pro Ser His Pro Ala Thr 
405 410 415 



Pro Ala Ser Arg Arg Pro Phe Pro Arg Ser Ala Arg Gin lie Gin Arg 
420 425 430 



Leu Pro Gly Ala Gly Asp Gly Val Val Pro Thr Ala Glu Gly Trp Thr 
435 440 445 



Leu Ser Met Ser Asp Ala Ala Cys Gly Gin Pro Tyr Pro Asn Pro Thr 
450 455 460 



Ala His Pro Asp Asn Gin Asn Leu Val Arg Pro Pro Gly Ser Cys Leu 
465 470 475 480 



Val Trp Ser Gin Val Leu Ser Ala Pro Ser Pro Gly Pro Phe Thr Leu 
485 490 495 



Gin Glu Leu His Ala Pro Leu Thr Ser Ala Phe Pro Trp Gin Gin Arg 
500 505 510 
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Gly phe Ala Gly Arg Pro Gly Ser Pro Glu His Ser Ser Pro Leu Pro 
515 520 525 



Gly Gly Leu Leu Ala Leu Ala Gly Asp Thr Ser Arg Ser Phe Lys Cys 
530 535 540 



Pro Leu Gin Ser Leu lie Asn Asp Pro lie His Ser Pro Leu Leu Ser 
545 550 555 560 



Phe Val Ser Ala lie Glu Lys Cys Leu Pro Arg Ala Ala Leu His Phe 
565 570 575 



Arg Pro Leu Phe Cys Val Leu Leu 
580 
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